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ABSTRACT 

Reflecting ■ interdisciplinary emphasis that 
reading comprehension has received during the past decade, the 
articles in this volume deal with both the processes involved in 
reading and the instructional practices used in teaching it-^ The six 
ar'ticles 'devoted 'to reading processes deal specifically with the 
following topics: schemata r comprehension -of text stractares,r 
vocabulary knowledae, the social context of learning to read^ and 
sociai-psycholoaical perceptions and reading comprehension. The six 
articles concei-ning instructional practices report on instructional 
variables in reading comprehension academic learn-ing time and 
reading achievement, the role of reading in bilingual contexts, 
characteristics of exemplary reading programs, overcoming educational 
disadvantages, and recognising reading comprehension programs,. 
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Foreword 

The chapters in this excellent volume reflect . the strong 
interdisciplinary emphasis which reading comprehension has 
received during the past decade. The contribution of the 
cognitive scientist through ; :hjma theory, text structure, 
inferential reasoning, and word meaning clearly in evidence. 
These efforts extend far beyond basic work of the past which 
focused heavily on the word and sentence isolated from natural 
language text. The impact of sociolinguistics is felt through the 
examination of instructional interaction in the language environ- 
ment of the classroom. The importance of reading in bilingual 
education is not neglected in this collection and forecasts the 
great need for research in this area in the immediate years ahead. 

A continued challenge is present in the reading field to 
bridge the chasm between understanding of the reading process 
and instructional practice. This area is ■ not left untended. 
Emphasis is given to the importance of instructional time m the 
learning setting, to characteristics of exemplary reading pro- 
• grams, knd to the impact of reading instruction spawned during 
a period of social unrest emphasizing the equality of educational 

opportunity. , , 

This volume demonstrates the continued dedication otthc 
International Reading Association to the improvement of 
literacy. John Guthrie and the contributors to this work are to be 
commended on their interdisciplinary and applied focus 
illuminating both reading process and practice. 

Robert B. Ruddell 
University of California at Berkeley 
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Introduction 



During the 1970s, the scope of research on reading broadened. It 
now requires a larger definition to encompass studies that are 
designed to be about reading. A wider array of investigators seek 
to illuminate the many phenomena that characterize this human 
ability in its proficient stage, lU its acquisition. There have always 
been investigators who use reading as a medium. Studies are 
often conducted on visual perception, language comprehension, 
persuasion, or consumerism, in which reading effectively serves 
as a tool. Although much can be learned about reading from this 
latter kind of research, the information is often inaccessible or 
buried through underinterpretation. It is the former collection 
that has broadened in definition and increased in number during 
the past decade. 

To illustrate the research on reading that was prominent 
in the beginning of the 1970s, the annual summaries of reading, 
published in the Reading Research Quarterly in winter 1 970, may 
be used. In that issue, 416 reports of reading research -e 
compiled under William S. Gray's classic categories of the 
sociology, physiology, psychology, and teaching of reading. 
Within the psychology of reading, a preponderance of studies 
was on cognitive processes, although studies of language, 
personality, and sex differences made a nominal showing. The 
majority of published research papers on cognitive processes 
\ was focused on visual perception, auditory processes, and visual- 
\auditory integration. The word, as opposed to the phrase, the 
paragraph, or the story, was used as the unit of analysis. 
Characteristics of words, such as their frequency of occurrence in 
written materials or the concreteness of nouns, were analyzed; 
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and jxiircc! associate Icai ning was a favorite paradigm for 
studying their case of acquisition. Rcseareli on these cognitive 
variables comprised 64 percent ol the research on the psychology 
of reading. The only foreshadow of research to come in the'7()s 
was the work of Lawrence Frase on questions and memory for 
text which was published mostly in the Journal of Educational 
Psycholoi^y and regarded at the time as intriguing but somehow 
beyond the pale of reading. 

Also in 1970, the teaching of reading attracted the 
attention of educational researchers. Although a few fruitless 
statements about the status of reading instruction were being 
made (e.g., how many remedial reading teachers are located in 
certain districts of a state) and some forays into reading readiness 
were made, the bulk of the investigations pertained to methods of 
instruction. Of 25 studies on the teaching of reading, the 
comparison of allegedly different methods represents 80 percent 
of the group. This was the era of contrasting of initial teaching 
alphabet (i.t.a.) with conventional instruction; of visual per- 
ceptual training compared to no control; and basal instruction 
versus basai plus phonies. What typified these studies was the 
comparison of one or more series of tasks that were given to 
children to facilitate the acquisition of word recognition or 
reading comprehension ability. The experiments consisted of 
altering cognitive tasks or their sequences to examine the effect 
on reading achievement. Although a study by Labov on the 
relationship between reading achievement and school-related 
values of black adolescents was included in the annual summaries 
of 1970, this article WaS regarded as a contribution to sociology, 
rather than to our understanding of reading. 

By 1980, the cognitive processes in reading underactive 
investigation were expanded to include the comprehension of 
story structures, integration of sentences, drawing inferences, 
testing hypotheses, relating background knowledge to textual 
information, and reading as a process of information search. To 
accommodate a veritable explosion in these areas, several 
journals were founded, including Cognitive Psychology. Dis- 
course Processes, and Cognitive Science, Some other periodicals 
have undergone a substantial reorientation. These veins are 



represented in the present volume in the ehapters of l)a\icl 
Riinielhart, Tom I'rabasso, David Pearson, and Richard 
Anderson, who are not merely spokesmen but the agents of 
expansion in these topics. 

It has been increasingly reeogni/ed that, although reading 
is a cognitive opcrafio*;, learning to read is a social act. Since 
classrooms contain a large number of children who are often 
grouped into sections with different roles, responsibilities, 
expectations, activities, and relationships to the teacher, the 
social dimension of classrooms \s"-coming to be a topic for close 
observation-.7).'nvesligators who see classrooms as microsocietics 
and who use the observational tools of anthropologists have 
introduced some provocative concepts and pcrspecti/es to 
reading educaticn. These are partially expressed in the present 
volume by Ca/den and less directly by Walberg and his 
coauthors. 

In studies ol classroom practice for reading education, the 
boundaries have likewise been extended. Teaching reading has 
been redrawn to include not only a comparison between methods 
(which is here represented by Singer and Balow in a description 
of the Follow Through experiment), but also several independent 
strategies of investigation. One of these lines, summarized by 
Jenkins and Pany, consists of attempts to increase the proficiency 
with which children perform cognitive processes that are found 
to be related to reading ability in basic research efforts. A second 
type of study is focused on exemplary reading programs. Here 
summarized by Samuels, these, investigations locate reading 
education programs known to'be extraordinary for producing 
achievement and attempt to describe them as fully as possible. 
The expectation is that such descriptions may enable other 
educators to duplicate the programs elsewhere. Third, classroom 
practice in reading instruction has come to be viewed as a 
complex organization in which the analysis of time is essential. 
Berliner gives a summary of a common viewpoint about the 
importance of understanding and fostering the optimal use of 
time by teachers and pupils in the classroom to improve reading 
achievement. 

A sobering set of problems in reading education stems 
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from children whose hinguagcs or dialects are different from the 
^'standard" used in schools. Although few coherent studies have 
addressed this problem within the United States, l:lley reports 
that for children of the South Pacific, proHciency in oral 
language and reading can be best accomplished ihroujih 
simultaneous teaching toward an integrated set ol* goals. 

What this volume intends to portray is the new 
topography in research on the processes and teaching ofreading. 
To obtain the breadth of vision attempted here in a single 
V :ic, one necessarily reduces the clarity of focus. The findings 
ol this book will provoke, not preclude, your verification; and the 
methodologies invite a cross-examination. Nevertheless, the 
authors of these chapters arc to be commended for their openings 
into what we may come to see about reading. 

JOHN I'. (a rnRH-; 



Processes in Reading 



Schemata: The Building Blocks of Cognition 

David E. Rumclhari 

University of California at San Dii\i^o 

The notion of a schema and the related notions of frames, scripts, 
plans, etc., have been emphasized in Cognitive Science since the 
mid-1970s (cf. Bobrow & Norman, 1975; Chafe, 1976; Fillmore, 
1975; Minsky, 1975; Moore & Newell, 1973; Rumelhart, 1975; 
Schank& Abelson, 1975; Winograd, 1975). It ismy intent, inthis 
paper; to introduce these concepts to readers unfamiliar with 
them and to show why so much attention has been paid to them. 
These v:.nous terms have been used by different authors to refer 
to any of a set of interrelated concepts. These terms are not all 
synonymous. Different authors have different things in mind 
when they use the different terms. Nevertheless, the various 
concepts are closely enough related that a discussion of any one 
of them will serve as an'iiitroduction to the others. 1 will thus 
focus my discussion on the one I know best, schemata (the 
singular is schema), as developed in Rumelhart and Ortony 
(1977). 

* ■ The term schema comes into psychology most directly 
from Bartlett (1932). Bartlett himself attributes his use of the 
term to Head (1926). However, it would appear that Kant's 
(1787) use of ti. j term already anticipated its major conceptual 
content. The ohd gives the following definition of the term: 

In Kant: Any one of certain (orms of rules of'^produetive imagination" 
through which the understanding'is able to apply its '•categories" to the 
manifold of sense-perception in the process of realizing knowledge or 
experience. 

Adapted from a paper by the same title.which appears in Spiro. Bruce, and Brewer (Eds.). 
Theoretical Issues in Reading Comprehension. Reprinted by permission of l.awrcncc 
Erlbaum Associates. Inc. 
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SoniL* lurtlicr discussiDU ol Kant's \ ic\v is ui\ cn in KuiucllKirt arul 
Ortony (1977). It is because of iliis liislorical prccctlcncc: that 1 
have chosen to retain ihe term schema. 

F-or all ol the authors menlioned aht)ve, schemata truly 
are the huiUiiuj^ blocks of Ci>y:)uti(UL The\' are the iunclamentai 
elements upon which all in format iiMi proccssinii depends. 
Schemata are employed in the process of irUcrprcting sensory 
data (both linimistie and nonlinguisiic). in retrieving information^ 
from memory, in organizing actions, in determining goals and 
subgoals. in allocating resources and generally in guiding the 
flow of processing in the system. Clearly, any de\-icc capable ofall 
these wondrous things must be powerful indeed. Moreo\'er. since 
our understanding of none of these tasks which schemata are 
suj^poscd to carry '.)Ut has reached maturity, it is little wonder 
that a definitive explication of schemata does not yet exist and 
that skeptics "iew theories basctl on them with some suspicion. In 
this paper. 1 hope to spell out. as clearly as pt)ssible. the nature of 
schemata and the kuids of problems they were devised to solve. In 
addition. I hope to present a convincing case that indeed the 
framework pro\'ided by schemata and allied concepts docs, in 
fact, form the basis for a reasonable theory t^f human 
information processing. 

My discussion through the next se\eral sections ol the 
paper will be abstract. Although 1 will not make direct 
application t^f these concepts to a thcc^ry of reading until near the 
end of the paper, many of the papers in this volume will illustrate 
the ways in which schemata can lead to insightful analyses of the 
reading process. 

What Is a Schema? 

A schema theory is basically a theory about knowledge. It 
is a theory about how knowledge is represented and about how 
that representation facilitates the use of the knowledge in 
particular ways. A -cording to "schema theories" all knowledge is 
packaged into units. These units are the schemata. Embedded in 
these packets of knowledge is. in addition to the knowledge itself, 
information about how this knowledge is to be used. 
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'A schema, then, is a data structure for representing the 
generic concepts stored in memory. There are schemata 
representing our knowledge about all concepts: those underlying 
objects, situations, events, sequences of events, actions and 
sequences of actions. A schema .contains, as part of its 
specification, the network of interrelations that is believed to 
normally hold among the constituents of the concept in question. 
A scheipia theory embodies a prototype theory of meaning. That 
is, inasmuch as a'schema underlying a concept stored in memory 
corresponds To the meaning of that concept, meanings are 
encoded in terms of the typical or normal situations or eA-enis 
which instantiate that concept. 

Rather than attempting a formal description of schemata 
and their characteristics at this point, I will turn instead to some 
useful analogies in hopes of giving the reader a more concrete 
notion of the nature ofschemataasi understand them. I will turn 
first to one of the more fruitful analogies, that of a play. 

Schemata Are Like Plays 

The internal structure of a schema corresponds, in many 
ways, to ihe script of a play. Just as a play has characters which 
can be played by different actors at different times without 
changing the essential nature of the play, so a schema Has 
variables which can be associated -with (bound to) different 
aspects of the environment on different instantiations of the 
schema. As an example, consider the schema for the concept hu\\ 
One can imagine a playwright having written a most mlindane 
play in which the entire play consisted of one person purchasing 
some object from another person. At minimum, such a play must 
have two people, some merchandise, and some medium of 
exchange. Whatever else happens, at the outset of the play one 
character (call him/ herthe purchaser) must possess the medium 
of exchange (call it the money)/ The second person, the seller 
must possess the object in question, the mercflandlse. Then, by 
some interaction (bargaining) a bargain is struck and the seller 
agrees to give the merchandise to the purch.aser in exchange for 
a quantity of the money. There would, of course, be many wayr. of 



The Building Blocks of Co^^nition 



playing this little play. The merchandisk could vary from a 
trinket of little value to an object of incalculable worth. The 
SELLHR and the purciiasi-r could vary in status, occupation, sex, 
nationality, age, etc.; "the money could vai'y in amount, and 
whether it was actually money or clam shells'; and the 
BARGAINING could vary in form. Still, through all of this 
variation, as long as the fundamental plot remained the same wc 
could say. that the buy play was being performed. 

Now, this little play is very much like the schema that I 
believe underlies our understanding of the concept buy or that for 
selL There are variables corresponding to the characters in the 
play. We have the purchaser, the seller, the money, the 
MERCHANDISE and the • BARGAINING. When wc understand a 
situation to be a case of buying, we come to associate persons, 
objects, and subevents with the variables of our schema. Upon 
having made these associations, we can determine to what degree 
the situation wc are observing corresponds to this /^ro/ony;^' case 
of bi:ying. 

Just as a playwright often specifies characteristics of the 
characters in his play (age, sex, disposition, etc.) so, too, as part 
of the specification of a schema, we have associated knowledge 
about the variables of the schema. We know, for example, that 
the PURCHASER and seller are normally people and that the 
money is normally money. Moreover, wc know that the value of 
Thc^^MONEY in question will co-vary with the value of the 
MERCHANDISE, ctc. Such knowledge about the typical values of 
the variables and their interrelationships is called the variable 
amstraiJit:'. 

"I'hcse constraints serve two important functions in a 
schema theory. In the first place, variable constraints help in the 
identification of the various aspects of the situation with the 
variables of the schema. If we knqlv that we are observii a case 
of BUYING, we are not going to map the purchaser variable into 
the object in the world which should serve as the money. We 
know this, in part, because we know that the purchaser is 
normally an animate being whereas the money is normally 
money or some other inanin ite object. In the second place. 
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variable constraints can help by serving as default values (cf. 
Minsky, 1975) or initial "guesses'' for variables whose values we 
have not yet observed. Thus, for example, if we take a certain 
transaction to be one of buying, bur do not notice the money, we 
can infer that there was money and that, in fact, the monky 
probably was money amounting iri value to about the value of the 
merchandise In this way, ' the schema can help us make 
inferences about unobserved aspects of a situation. 

It is perhaps useful to note here thai variable constraints 
offer default values for unobserved conditional on the 

values of the observed variables. M oreover, the constraints are 
not <ilhor-none constraints which require that certain variables 
haA^e aJixed range of values. Rather, they are merely specifica- 
tidlK^the normal range bf value| for each variable and how this 
normal range varies with the specification of various combina- 
tions of other values on the other Variables. Thus, as Rumelhart 
and Ortony (1977) suggest, it is perhaps most useful to think of 
variable constraints as forming a kind of multivariate disiribu- 
Hon with correlations among the several variables. 

There is also the notion of an instantiation of a schema 
which corresponds to an enactment of a play, A play is enacted 
whenever particular actors, speaking particular lines perform at a 
particular time and place. Similarly, a schema is instantiated 
whenever a particular configuration of values is bound to a 
particular configuration of variables at a particular moment in 
time. Interpreting a situation to be an instance of some concept, 
such as an instance of buying, involves, according^ the present 
view, the instantiation of an appropriate schema, say the niiY 
schema, by associating the various variables of the schema with 
the various' aspects of the situation. Such a schema along with its 
variable bindings is called an instantiated schema. Just as we 
could, say, take a movie of an enactment of a play and thereby 
save for posterity a trace of the enactment, likewise it is the traces 
of our instantiated schemata which serve as the basis of our 
recollections. 

Before leaving the analogy between the script of a play 
and a schema, it is useful to note that neither is a complete 
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specification of every detail — both allow room for irrelevant 
variation and creative interpretation. The script of a play, no 
matter how meticulous the playwright, allows for an infinity of 
variations, each of which can properly be considered an 
enactment of the play. Certain lines composed by the playwright 
are sometimes changed to suit the interpretation of the director. 
Nevertheless, within limits, it is the same play. So it is with 
schemata. A schema is not so rigid!;; applied that no variation is 
allowed. The schema only provides the skeleton around which 
the situation is interpreted. Variations orthogonaLto the specifi- 
cations of the schema have no bearing on the quality v/ith which 
the schema is said to account for the situation. Moreover, even 
minor aspects of the situation which might be considered central 
to the schema can undergo some variation before we completely 
reject the interpretation provided by the schema. 

Finally, despite all of^the ways in which a schema is like a 
play, there are also numerous ways in which a schema is unlike a 
play. Perhaps most important of these is degree of abstraction. I n 
our example of the buv ,schema, we imagined a play that was 
more abstract than any o.ie playwright would ever compose. 
Normally, the playwright v..: >W determine the kirul of buying 
involved, as well as more dc ' bout the characters and more 
constraints on the dialogue. Tr,.; iu;v schema, on the other hand, 
must be applicable to any case of buying and thus must, 
necessarily, be more abstract than any actual play would ever be. 
Moreover, whereas a play is normally about people and their 
actions, a schema may be about events and objects of any sort. 
Indeed, a schema may merely be about the nature of a wholly 
inactive object such as a chair. In this' case, the schema specifies 
not action or event sequences, but rather spatial and functional 
relatir^iships characteristic of chairs. Finally, although a play 
may roniain acts, each with its own structure, a script for a play 
exists really only on one level. A script does not consist of a 
configuration of sub-scripts. A 'schema, on the other hand, 
should be viewed as consisting of a 'configuration of sub- 
schemata corresponding to the constituents of the concept being 
represented. These points will be made clearer in the following 
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sections when I draw analogies between schemata and oih^^ 
familiar concepts. 

Schemata Are Like Theories 

Perhaps the central function of schemata is in the 
construction of an interpretation of an event, object, or 
situation — that is, in the process of comprehension. In all of this, 
it is useful to think of a schema as a kind of informal, private, 
unarticulated theory about the nature of the events, objects, or 
situations which we face. The total set of schemata we have 
available for interpreting our world in a sense constitutes our^ 
private theory of the nature of reality. The total set of schemata 
instantiated at a particular moment in time constitutes our 
internal model of the situation we face at that moment in time. 
Or, in the case of reading a text, a model of the situation is 
depicted by the text. 

Thus, just as the activity surrounding a theory is often 
focused on the evaluation of the theory and the comparison of the 
theory with observations we have made, so it is that the primary 
activity associated with a schema is the determination whether it 
gives an adequate account for some aspect of 'our current 
situation. Just as the determination that a particular theory 
accounts for some observed results involves the determinations 
of the parameters of the theory, so the determination that "a 
particular configuration of schemata accounts for the data 
presently available at our senses requires the determination of the 
values of iht variables of the schemata. If a promising schema 
fails to account for some aspect of a situation, one has the options 
of accepting the schema as adequate in spite of its flawed account 
or of rejecting the schema as inadequate and looking for another 
possibility. Therefore, the fundamental processes of campTrefien- 
sion are taken to be analogous to hypothesis testing, evaluation 
of goodness of fit and parameter estimation. Thus, a reader of a 
text is presumably constantly evaluating hypotheses about the 
most plausible interpretation.of the text. Readers are said to have 
understood the text when they are able to find a configuration of 
hypotheses (schemata) which offer a coherent account for the 
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various aspects of the text. To the degree to which a particular 
reader fails to find such a configuration, the text will appear 
disjointed and uncomprchcnsible. 

Schemata are like theories in another important respect. 
Theories, once they arc moderately successful, become a source 
of predictions about unobserved events. Not ail experiments are 
carried out. Not all possible observations are made. Instead, we 
use our theories to make inferences with some confidence about 
these unobserved events. So \C\s with schemata. Wc need not 
observe all aspects of a situation before we are willing to assume 
t;hat some particular configuration of schemata oHers a 
satisfactory account for that situation. Once we have accepted a 
configuration of schemata, the schemata themselves provide a 
richness which goes far beyond our observations. Upon deciding 
that we have seen an automobile, we assume that it has an engine; 
uw'adlights, and all of the standard characteristics of an 
automobile. We do this without the slightest hesitation. We have 
complete confidence in our little theory. This allows our 
interpretations to far outstrip our sensory observations. In fact, 
once wc have determined that a particular schema accounts for 
some event we may not bd able to determine which aspects of our 
beliefs areM^ascd on direct sensory information and which are 
merely consequences of our interpretation. 

Schemata Are Like Procedures 

There are at least two inadequacies of the analogies 
presented above. I n the first place, plays and theories are passive. 
Schemata are active processes. In the second place, the 
relationship between a theory and its constituent sub-theories or 
between a play and its constituent sub-plays is not always 
evident. Schemata, on the other hand, have a very weil defined 
constituent structure. 

In both of these ways, schemata resemble procedures or 
computer programs. Schemata are active computational devices 
capable of evaluating the quality of their own fit to the available 
data. That is, a schema should be viewed as a procedure whose 
function it is to determine whether, and to what degree, it 

■ 1 o 
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^ accacfnt^kfor the pattern of observations. This includes, among 
other things, associating its variables to the appropriate aspects 
of its envimnment— i.e., binding its own variables. Thus, to the 
degree thaty schemata underlying concepts are identified with 
meaning oftfwse concepts, a schema theory is both 'd prototype 
theory and a\procecIural theory of meaning. Obviously, the 
degree to whlfch a schema theory of human information 
processing can work depends on the degree to which procedures 
can actually b^ojnstructed to carry out the tasks I have just 

* assigned to ttferrT^beiieve they can and will address this issue in 
th J following sections. 

second characteristic which schemata share with 
procedures is a structural one. Procedures normally consist of a 
V network (or a tree) of sub-procedures. A particular procedure 
normally carries out its task by invoking a pattern of sub- 
procedures each of which in turn operates by invoking its sub- 
procedures. Each procedure or sub-procedure can return values 
which can serve as conditions determining which if any further 
sub-procedures are to be invoked. So it is with schemata. A 
schema is a network (or possibly a tree) of sub-schemata each of 
which carries out its assigned task of evaluating its goodness of fit 
whenever activated. These sub-schemata represent the concep- 
tual constituents of the concept being represented. 

Thus, for example, suppose we had a schema for a face. 
This would consist of a certain configuration of sub-schemata 
each representing a different constituent of a face. For example, 
there would presumably be a sub-schema representing the 
MOUTH, one for the nose and one for each har and each fve. 
These sub-schemata in turn would consist of a configuration of 
. constituents. The eye schema, for example, would consist of a 
configuration of sub-schemata including, perhaps, an iris, 

EYELASHES, an EYKBUOW, CtC. 

Just as a procedure uses results produced by its sub- 
procedures to carry out its task, so too a schema uses results 
produced by its sub-schemata to carry out its tasks. As I indicated 
above, the primary a'ctivity pf a schema is the evaluation of its 
goodness of fit. An important mechanism of this evaluation 
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involves the evaluation of the goodness of fit of each of its 
constituent parts. Thus, if a good EYH is found and a good mouth 
is found, the face schema can use this information along with its 
own evaluation of whether the entire configuradon is right tor a 
face to generate an overall evaluation of its goodness of fit. 

To summarize then, just as a procedure consists of sub- 
procedures and those sub-procedures, in turn, consist of more 
sub-procedures, etc., so a schema consists of sub-schemata each 
of which, in turn, is specified as a configuration of its sub- 
schemata, etc. One may be struck by the fact that this process 
must stop somewhere. If each and every .schema were merely a 
configuration of sub-schemata the process would never end. The 
solution to this dilemma for schemata is identical to the solution 
for procedures. When a computer program is written, this 
embedding process does not continue indcfinitch . Ev^-ntually, 
some sub-procedure consists entirely o: a c^niTiguration of 
elementary insiructions for rh:; machine in qucnic-n. Likewise, 
with schemata, there must b-: a ^:ct of schemata which are 
elementary, in the sense that they do not consist of a further 
breakdown in terms of sub-schemala. Such elementary schemata 
correspond t.o what Norman and Rumelhart (1975) .call 
primitives. 

Schemata Are Like Parsers 

A parser is a device which, given a sequence of symbols, 
determines whether that sequence forms a legal sentence 
(according to the rules of some grammar) anS, if it does, 
determines the constituent structure of the sentence. That is, it 
determines which symbols in the sequence correspond to which 
constituents of the sentence. The process of finding and verifying 
appropriate schemata is thus a kind of parsing process which 
woiks with conceptual elements— finding constituents and sub- 
cons-thuents among the dat:: currently impinging on the system in 
much the same way that r. sentence parser must find the proper 
parse for the input string"bf words. 

One particularly useful aspect of this analogy is the 
substantial body of work carried o\\\ in computational linguistics 
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on various parsing procedures. I believe that the processing 
strategies developed for some of the most sophisticated of these 
will carry over nicely in their application to schemata generally. 
As I will discuss below, I have in mind here especially the work of 
Kaplan (1973) and his development of the general syntactic 
processer (gsp). 

Summary of the Major Features of Schemata 

Rumelhart and Ortony(1977) listed four major character- 
istics of schemata. These were: 

1. Schemata have variables. 

2. Schemata can be embedded, one within another. 

.3. Schemata represent knowledge at all levels of abstrac- 
tion. 

4. Schemata represent ^wow/e^/ge rather than definitions. 

The analogies presented above illustrate all of these features. 
Whereas schemata have variables, plays have roles, theories have 
parameters, and procedures have arguments. The embedding 
characteristic of schemata is best illustrated by the analogy 
between schemata and procedures. Schemata consist of sub- 
schemata as procedures consist of sub-procedures. Just as 
theories can be about the grand and the small, so schemata can 
represent knowledge at all levels— from ideologies and cultural 
truths to knowledge about what constitutes an appropriate 
sentence in our language to knowledge about the meaning of a 
particular wr:-d to knowledge about what patterns of excitations 
are associated with what letters of the alphabet. We have 
schemata to represent all levels of our experience, at all levels of 
abstraction.' Finally, our schemata ^re our knowledge. All of our 
generic knowledge is embedded in schemata. 

In addition to these four features, the analogies presented 
here indicate at least two m.ore general features of schemata: 

5. Schemata are active processes. 

6. Schemata are recognition devices whose processing is 
aimed at the evaluation of theirgoodness of fit to the 

data being processed. 
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The Control Structure of Schemata 

Perhaps the central questions in the development of a 
schema based model of perception and co^nprehension are: first, 
how is an adequate configuration of schemata discovered and, 
second, how is the goodness of fit evaluated? These arc largely 
problems of conirol structures. There are many schemata. Not all 
of them can be cvaluated^at once. Somehow, there must be a 
schema for activating just those schemata which are most 
promising. There are two basic sources of activation for 
schemata. These are usually referred to as top-down and boUoin- 
up activation. These two directions correspond to what Bobrow 
and Norman (1975) have called conceptua/ly-clriven and dafa- 
driven processing. 1 turn now to a discussion of these two modes 
ol activation. 

Conceptually-Driven and Data-Driven Processing 

A schema may activate a sub-schema in the way a 
procedure invokes its sub-procedures. This is called conceplually- 
(Iriven processing. In a sense, conceptually-driven processing is 
expectation-driven processing. That is, when a schema is 
activated and it, in turn, activates its sub-schemata, the activation 
of these sub-schemata arc derived from a sort of expectation that 
they will be able to account for some portion of the input data. 
For example, suppose that, through some mechanism, the vacu 
schema is considered a promising account for the 'input and 
thereby activated and set about evaluating its goodness of fit; The 
promise of the fach schema is, in a sense, transferred to its 
MOL'TH, N'OSH, HVH, KAR, etc. *sub-schemata. 

A second mechanism for schema activation is bottom-up 
or data-driven activation. A schema is said to be activated from 
the bottom-up whenever a sub-schema which has been somehow 
activated causes the various schemata of which it is a part to be 
activated. If the activation of the FAcn schema led to the 
activation of the phrson schema, we would say that the activation 
of the PFRSON schema was data-driven. Thus, where concep- 
tually-driven activation goes from whole to part, data-driven 
activation goes from part to whole. In schema directed 
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processing, activation goes in both directions'. ,' 

Schema directed processing is assumed to proceed in 
roughly the following way: Some event ocfcurs at the sensory 
system. The occurence of this event "automatically" activates 
certain "low-Ievel" schemata (such schemata might be called 
feature detectors). These low level schemata would, in turn, 
activate (in a data-driven fashion) certain of the ''higher lever" 
schemata (the most probable ones) of which they are constitu- 
ents. These "higher level" schemata would then initiate 
conceptually-driven processing by activating the sub-schemata 
not already activated in an attempt to evaluate its goodness of fit. 

At some point, when one of these higher level schemata 
began to get further positive results about its goodness of tit (i.e., 
it found evidence for other of its constituents), il would activate 
still higher level schemata which would look for still larger 
constituents. 

This higher, more abstract schema would then activate, 
from the top-down, still other of its constituent schemata; and 
this activation would flow through its sub-schemata back down 
to lower-level schemata which would eventually make contact 
with either other schemata which have been activated from the 
bottom-up or they will initiate a sear^ch for the "predicted" 
sensory inputs. 

Whenever a schema initiates a search for sensory data 
which are not present, that counts as evidence against that 
schema and also as evidence against all of those schemata which 
require the presence of that schema as a constituent sub-schema. 
When sufficient evidence is accumulated agai* ^t a schema, 
processing of that schema is suspended and processing resources 
are allocated to other currently more promising schemata. 
Whenever enough evidence is gained in favor of a schema, that 
schema is taken as an adequate account for the relevant aspect of 
the input and the interpretation offered by that schema is taken as 
the "correct" interpretation of the relevant event. Later 
processing, on other, higher-level schemata may eventually 
disconfirm a temporarily accepted schema and we will have the 
phenomenon of the "double-take." 
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My discussion of the processing system to this point has 
been rather abstract. In the following section I willcxamine, in 
some detail, an example of this mixed initiative processing 
system. 

An Example 

Consider the following brief passage: 

Business had been slow since the oil crisis. Nobody seemed to want 
anything really elegant anymore. Suddenly the door opened and a well 
dressed man entered the showroom Roor. John put on hk friendliest 
and most sincere expression and walked toward the man. 

Although merely a fragment, most people generate a 
rather clear interpretation of this story. Apparently, John is a car 
salesman fallen on hard times. He probably sells rather large, ele- 
gant cars (most likely, Cadillacs). Suddenly a good prospect enters 
the showroom where John works. John wants to make a sale. To 
do that he must make a good impression on the man. Therefore he 
tries to appear friendly and sincere. He also wants to talk to the 
man to deliver his ules pitch. Thus, he makes his vvr^y over to the 
man. Presumably, had the story continued John would have made 
the sales pitch and, if all went well, sold the man a car. 

■ How do people arrive at such an interpretation? Clearly, 
people do not arrive at it all at once. As the sentences are read, 
schemata are activated, evaluated, and refined or discarded. 
When people are asked to describe their various hypotheses as 
hey read through the story, a remarkably consistent pattern of 
hypotheses generation and evaluation emerges. The first sentence 
is usually interpreted to mean that business is slow because of the 
oil crisis. Thus, people are led to see the story as about a business 
which is somehow dependent on oil and is suffering. Frequent 
hypotheses involve either the selling of cars, or of gasoline. A few 
interpret the sentence as beingaboul the economy in general. The 
second sentence, about people not wanting elegant things 
anymore, leads people with the gas station hypothesis into a 
quandary. Elegance just doesn't fit with gas stations. The gas 
station hypothesis is weakened, but not always rejected. On the 
other hand, people with hypotheses about the general economy 
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or about c^ars have no trouble incorporating this sentence into 
their emeij^ing iti,tcrprctaiion. In the former case, they conclude it 
means t|^lit pocpfe don't buy luxury items; and, in the latter, they 
assum./it ryieans that people don't buy large elegant cars- 
Cadillacs much anymore. The third sentence clinches the car 
interpretatlpn for nearly all readers. They are already looking for 
a busiViess interpretation -that most probably means a selling 
interprbtatio.n:~j^j?ind when a well-dressed man enters the door he 
is immediately labeled as someone with money-— a pro- 
spective HrvFK. The phrase showroom floor clearly invalidates 
the gas station interpretation and strongly implicates auto- 
mobilei> which are often sold from a showroom. Moreover, the 
occurrence of a specific event doesn*l fit at all well with the view 
that the passage is a general discussion of the state of the econ- 
omy. Finally, with the introduction of John, we have an ideal 
candidate for the si-lu-r. John's actions are clearly those stereo- 
typic of a salesman. John v^nts to make a sales and his "putting 
on'' is clearly an attempt on his part to ''make a good impression." 
His movement toward the man fits nicely into this interpretation.. 
If he is a salesman, he must make contact with the man and deliv- 
er the stereotypic "pitch." 

Qualitatively, this little account (which was derived from 
an analysis of a number of readers describin-g their current inter- 
pretation of the story after each sentence) fits well with the gener- 
al approach I have been outlining. The process of comprehension 
is very much like the process of constructing a theory, testing it 
against the data currently available, and, as more data.become 
available, specifying the theory further— i.e., refining thV default 
values (as perhaps was the case when. those holding the "car hy- 
pothesis" from the beginning encountered the sentence about 
nobody wanting anything elegant anymore). If the account be- 
comes sufficiently strained, it is given up and a new one con- 
sfructed, or, alternatively, if a new theory presents itself which 
ohx'iously gives a more cogent account, the old one can be 
dropped and the new one accepted. 

But where do these theories come from? The theories are, 
of course, schemata. Presumably, through experience we have 
built up a vast repertoire of such schemata. We have schemata for 



The Building Blocks of Cognition 



17 



salesmen, the kinds of motives they have and the kinds of tech- 
hiques they employ. We have schemata for automobiles, in- 
cluding how and where they are sold. We have built up schemata 
for the '*oil crisis," what kinds of effects it has on what kinds of 
businesses. We have schemata about business people, the kinds 
of motives they have and the kinds of responses they make to 
these motives. The knowledge embedded in Ihcse schemata forms 
the framework for our theories. It is some configuration of these 
schemata which ultimately forms the basis for our understand- 
ing. 

But how does a relevant schema suggest itself? It is here 
that the control strucuncs discussed above play an essential role. 
Presuniably, it is the bottom-up observation that a certain con- 
cept has been referenced that leads to the suggestion of the initial 
hypotheses. The notion that business was slow, suggests sche- 
mata about business and the economy. Since the slowness was 
dated from the occurrence of the oil crisis, it is a natural inference 
that the oil crisis was the cause of the slowness. Thus, a iu:siness 
schema is activated. The particular typk of business is pre- 
sumably a variable which must be filled. Fhe information about 
the oil crisis suggests that it may be an oil related business. Thus, 
readers are led to restrict the TV pf: variable of the nrsi *-'i:ss schema 
to oil related businesses. 

At this point, after the bottom-up activation -f the high 
level lU'siM-ss schema has occurred, this schema would generate a 
top-down acti\'ation of the various possible oil related busi- 
nesses. Prime candidates for these are, of course, automobile re- 
lated businesses. Of these, selling gasoline and automobiles are 
the two most salient possibilities. 

When the second sentence is encountered, an attempt is 
made to fit it into the schemata eui rently considered most prom- 
ising. As i discussed above, this information could serve to fur- 
ther restrict the tyim- variable in the automobile lU'siNHSs schema 
but doesn't fit well with the gasoli: j business schema. 

The Bi'SFNiiSs schema presumably has, as part of its speci- 
fication, a reference to the in;Y or si-M. schema discussed earlier. 
Once activated, these schemata search for potential variable 
bindings. In the ease of the automobile business, the mhrciian- 
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DISH variable is bound to an automobile. The second sentence 
suggests an elegant automobile. When the third sentence is en- 
countered, the reader has not yet found a canclidate for ruyhk or 
.SEij.HR. The sentence about a well-dressed man immediately 
.suggests a potential luiviiu. The phrase ''showroom floor'' offers 
additional bottom-up support for the automobile hypothesis. In 
fact, it is a strong enough clue itself that it can suggest automobile 
sales to a reader who currently considers an alternative schema 
more likely. We thus have a wvyhr and some mhrciiandisi:. The 
well-dressed quality of the ruyhr is consistent with our view that 
the MHRC-irANDisH is elegant and therefore expensive being well- 
dressed suggests MON^v. We need only a siiLUiR ~ i.e.. an 
automobile salesman. Readers probably alrciicly bring a relative- 
h '-omplete characterization oj* the "default value" forcarsales^ 
rr .n. We heed but little additional information to generate 
lather detailed description of his goals and motives. 

In spite of the length of this example, it should be noted 
that I have provided only a sketch of the elaborate processing 
which must occur in the comprehension of even so simple and 
direct a story as this. The problem 'is indeed a complex one and no 
one yet has been able to construct a model capable of actually 
carrying out the tasks involved. The conviction that the concept 
• of the schema is the most promisingVoute to thesolution to these 
problems has led to its current popularity. 

The Major Funcdous of Schemata 

My intent to this point has been primarily definitionaL 1 
have tried to sho\v what schemata are and generalK' how they are, 
supposed to work. In this secti*on I will give a few examples, 
mostly taken from the psychological literature, of phenomena for 
which schemata appear to offer promising accounts. 1 first turn 
to a discu.ssion of pcrccpfion. e necially as it relates to reading. 

Schemata and Perceiving 

There arc numerous examples in the psychological litera- 
ture which suggest a schema-like theory to account for them. I 
will mention just a few examples here, "Perception, like language 
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comprehension, is an interactive process. Information comes in 
from our sense organs, which suggest but do not clctermine 
appropriate schemata for the interpretation of the sense data. It is 
:oftcn only in the context of the whole that the individual parts of 
an object can be identified. Similarly, the whole itself cannot be 
identified apart from its parts. The int^erpretation o^.parts and 
wholes must proceed jointly. Our final interpretation is deter- 
mined both by the local clues and by consistency ^mong the 
various levels of analysis. Consider, as an exarrip,le> Figure 1 
taken from Palmer ( 1975). The object on the left is clearly recog- 
nizable as a face, but its parts (series B) are not recugni/able out 
of context. Thus, u cannot be that we first perceive the parts and 
then construct an interpretation of the whale. Rather, the various 
shapes of the lines sugi^esl, but do not determine, possible inter- 
pretations (the wiggly line suggests a possible nose, the acute 
angle suggests a possible eye, etc.). Lower levnl nosi-: and nvn 
schemata may be activated, which in turn may activate higher 
level schemata such as the fac i: schema. The l u r schema then 
activates schemata for all of the parts of the \\\cv. not receiving 
bottom-up activation. (For example, the lips may not be close 
enough to lips to activate ^his schema at all out ofcontext. Inthis 
case, the lips schema would be activated by the i-ac'L schema and 
find sufficient evidence to serve -in context - to count as lip.s.) 

As can be noted from series C of the Figure, it is not that 
parts of a face cannot ever be recognized without the face as a 
context. But. in order to be recognized out ofcontext, they too 
must have an internal structure. Ifenoughdata is available about 
its internal structure, a schema like the Nosi: schema can serve the 
function of an organizing whole perfectly well. 

There is ample evidence of similar processes in the reading 
process. It is well known, for example, the strings of characters 
which form words are more easily apprehended than strings 
which do not form words. The reason for this presumably stems 
from the fact that we have schemata corresponding to words and 
none for random letter strings. Just as evidence for a nose in- 
directly constitutes evidence for lips through the face schema, so 
too evidence for one letter can constitute evidence for other let- 
ters ' hrough the schema for the word in question, Thus, for ex- 
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A in context 



B out of context 




face 



nose 



eye 



car 



mouth 



Figure 1 

An illustration of part-whole context. Facial features recognizable in 
the contexrof a profile (A) are not recognizable out of context (B), 
When the internal part structure of the facial features is differentiated 
(C), however, the features become recognizable out of context. 

ample, evidence favoring a T in the first position and an E in the 
third position of a three letter word indirectly constitutes evi- 
dence for a H in the second position through activation of the 
THE schema. The use of such information fs presur^ably the 
mechanism whereby words are easier to see than random letter 
strings. Moreover, one of the characteristics which separate 
skilled readers from those with less skill is presumably the avail- 
ability of more, more completely developed word schemata. 

It is interesting that schemata not only contribute tov^ards 
the development of an accurate perception but, by the same to- 
ken they can sometimes cause a distortion. An experiment by 
Bruner and Potter (1964) illustrates the debilitating effect of 
premature commitment to a particular schema. In the study, 
subjects were presented with unfocused slides of familiar objects. 
The slides were*^owly brought into focus. At each step along the 
way, as the slides were brought into focus, subjects were to report 
their best guess as to what the content of the slide was. Under 
these conditions, subjects continued to mis-identify the object 
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long after naive subjects (those started with less severe amounts 
of defocusing) were able to readily identify the objeet in question. 

This resuh is presumably due to the fact that subjects be- 
came committed to their early interpretations of the slide and 
then needed more information to disconfirm their original hy- 
pothesis than is normally rctuiired. 

Schemata and Understanding Discourse 

As discussed above, the process of understanding dis- 
course is the process of finding a configuration of schemata 
which offers an adequate account of the passage inquestion^The 
analysis of the "oil crisis story" given above illustrates generally 
how such a process is supposed to operate. Clues from the story 
suggest possible interpretations (instantiations of schemata) 
which are then evaluated against the successive sentences of the 
story until finally a consistent interpretation is discovered. Some- 
times, a reader fails to correctly understand a passage. There are 
at least three reasons implicit in schema-theory as to why this 
might occur. 1) The reader may not have the appropriate sche- 
mata. In this case he/she simply cannot understand the concept 
being communicated. 2) The reader may have the appropriate 
schemata, but the clues provided by the author may be insuffi- 
cient to suggest them. Here again the reader will not understand 
the text but, with appropriate additional clues, may come to 
understand it. 3) The reader may find a consistent interpreta- 
tion of the text, but may not find the one intended by the author. 
In this case, the reader will "understand" the text, but will 
misunderstand the author. 

There £tre numerous examples of these three phenomena 
in the literature. Perhaps the most interesting studies along these 
lines were carried out by Bransford arid Johnson (1973). They 
studied the comprehension of texts in which subjects lacked the 
appropriate schemata, ones in which the schemata were poten- 
tially available, biit there were not sufficient clues to suggest the 
correct ones as well as ones on which subjects were led to choose 
a "wrong" interpretation. Consider, as an example, the following 
paragraph used in one of the studies. 
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The prol'cdure is actually quite simple. First you arrange things into 
different groups. Of eourse. one pile may be sufficient depending on 
hov/"mueh there is to do. Il you have to go somewhere else due to lack of 
facilities that is the next step"; otherwise you are pretty well set. It is 
irr/portant not to overdo things. That is. it is belter to do too few things 
at once than too many. Fii theshort run this may not seem important but 
complieatioiis ean easily' arise. A mistake can be expensive as well. At 
first the whole procedure will seem eompiicated. Soon, however, it will 
become jusi another facet of life. It isdinicult to foresee any end to the 
necessity for this task in the immediate future, but then one can never 
tell. After the procedure is completed one arranges the materials into 
different groups again. Then they can be put into their appropriate 
places. Eventually they will be used once more, and the whole cycle will 
then have to be repeated. However, that is part of life. [p. 400] 

Most readers find this passage extremely difficult to un- 
derstand. However, once they are told that it is about washing 
clothes, they are able to bring their clothes washing schema to the 
fore and make sense out of the story. The difficulty with this 
passage is thus not that readers don't have the appropriate 
schemata: rather, it stems from the fact that the clues in the story 
never seem to sui^gest the appropriate schemata in the first place. 
The ''bottom-iip'' information is inadequate to initiate the com- 
prehension process appropriately. Gnce the appropriate sche- 
mata are suggested, most people have no trouble understanding 
the text. 

Although most readers simply find the passage incom- 
prehensible, some find alternative schemata to account for it and 
thus render it -comprehensible. Perhaps the most interesting in- 
terpretation I have collected was from a Washington bureaucrat 
who had no difficulty with the passage. He was able to interpret 
the passage as a clear description of his job. He was, in fact, 
surprised to find that it was supposed to be about ''washing 
clothes'"' and not about ''pushing papers.'' Here, then, we have an 
example of the third kind of comprehension failure, '^under- 
standing the story" but ''misunderstanding the author." 

Obviously; a detailed account of the comprehension pro- 
cess requires a detailed description of the schemata readers have 
available, as well as an account' of the conditions under which 
certain of these schema!ta are activated;A number of researchers 
have been developing such specific models of specific schemata 
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(cf, Rumelhart, 1975, 1977; Schank & Abelson, 1975), Other in- 
vestigators have described h6w schemata are used to understand 
narratives (Mandler & Johnson, 1977), expository texts (Meyer, 
1975), and newspaper articles (Thorndyke, 1979). However, 
those instances of the use of schema theory to account forcon'i- 
prehension cannot be considered fully here for lack of space. 

Conclusion 

It was my intent in this paper to give the reader unfamiliar 
with schemata an intuition through which he/she could interpret 
the increasing number of papers employing these conceptual- 
izations. I have aimed for generality rather than specificity in my 
account. 1 have tried to show the many domains to which the con- 
cept of a schema has been applied and the heuristic value of 
thinking about psychological and educational problems in terms 
of schemata. Although the development of schema-based theo- 
ries, such as the ones I mention above, is yet in its infancy and 
these ideas have not yet proved their usefulness, I believe that 
they offer the most promising leads for thos'j of us interested in 
the difficult problems posed when we try to apply psychological 
theories directly to educationally relevant domains. 
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Comprehension of Text Structures 



p. David Pearson 

University of fllhiois at Urhana-Chanipaii^n 
Kaybcth Campcrell 
University of Wisconsin 

In this paper, we will review theoretical and empirical develop- 
ments in the comprehension of text structure over the past twenty 
years. Following that review, we will offer some suggestions 
about what this area of scholarship has to say about educational 
practice. The suggestions will be of two types: a) tentative sug- 
gestions about educational practices that educational publishers 
and/ or teachers ought to consider in preparing texts for students 
and lesson plans to' help them cope with variations in text struc- 
ture, and b) suggestions to educational researchers concerning 
classroom research which seem reasonable in the light of basic 
research about text structure influences. 

' iVhat Would It Mean to Find that 
Text Structure Influences Comprehension? 

Perhaps a good starting point for a review concerned with 
the influence of text structure'on prose comprehension is to ask 
of what consequence any conceivable findings might be. For 
example, suppose our review were to demonstrate that 90 percent 
of the variation in students' comprehension of prose materials 
was due to the influence of variation in text structure. Suppose 
we could demonstrate that b^ holding content (the ideas, 
concepts, and relations among concepts) of a passage constant 
and altering the surface structure in which the ideas are 
communicated we could move a student from 25 percent 
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comprehension of the passage to 75 percent comprehension. 
What would we recommend? Clearly, we would immediately 
inform the publishing industry thai we had made a breakthrough 
in communieation tcehnology and write manuals on how to 
communicate ideas effectively in prose. 

Suppose, alternatively that we found that variation in text 
structure had virtually no effect on comprehension. Sup- 
pose we found, in the hypothetical cxperimetit above, that 
sueh alterations yielded a modest 5 percent instead of a 50 percent 
gain in comprehension of passages. We wouid probably drop our 
heads a little and recommend that our colleagues look elsewhere 
for any answers to the question of how to improve our 
communieation efficiency look at ihc nature of the concepts 
themselves and relations among thcrn. perhaps. 

Suppose that we found that Nariations in text structure 
made a big difference for young students but that the differences 
between various levels of complexity decreased as a function of 
age. We would likely advise our publisher to avoid certain gram- 
matical structures or text organization patterns until some opti- 
mal age level at which, presumably, students have gathered 
enough linguistic experience to handle their complexity. 

Suppose that we found, along with this hypothetical de- 
velopmental effect, that we could overcome the deleterious effect 
of certain text structures on younger students by offering them 
direct instruction in dealing with those structures. We would ad- 
vise teachers that if they are going to present young students with 
text embodying those structures, they will have to teach students 
how to handle them. 

Findings like the hypothetical cases described above, 
while not quite so dramatic as our make-believe examples, have 
emerged from time to time over the past twenty or thirty years, 
sometimes but not always accompanied by recommendations 
like those we have suggested. That text structure influences com- 
prehension, therefore, is not really at issue: what is at issue is the 
precise way in which the influence is exerted, why the influence 
exists, and what the influences have to say about practical mat- 
ters of teaching and writing instructional materials. 
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What Counts as Text Structure 

Before we can review the literature on the influence of text 
structure, we need to define what we mean by variations in text 
structure. We will approach our definition through examples. 
Consider examples (1-4) below. 

1. The lad rode the steed to victory. 

2. The steed was ridden to victory by the lad. 

3. The young man rode the horse to a first place finish. 

4. The horse was ridden to a first place finish by the 
young man. 

Each of the four sentences exhibits a different surface form for 
the same underlying idea, yet there are only two different 
grammatical structures represented, the active and the passive. 
Hence the difference between (1) and (2) or between (3) and (4) 
can be regarded as a variation in text structure. However the 
differences between ( 1 ) and (3) or (2) and (4) are better character- 
ized as lexical, or possibly semantic, variations. In the language 
of tranformational generative grammar, we would say that (1) 
and (2) have the identical deep structure (underlying meaning), as 
do (3) and (4). ' By intuition we would probably agree that there is 
only a slight difference in the two deep structures attributable to 
co-^notative differences in the meanings of lad-young man, vic- 
tory-first place finish, and steed-horse. But the basic point is that 
we will regard alterations in the grammatical structure of sen- 
tences, which do not alter any semantic meanings or relations, as 
examples of text structure manipulations. 
Now consider examples (5-8). 

5. Henry lost the quarterback job because his arm gave 

out. 

•Many theorists would argue that the underlying meaning of the active/ passive pairs is 
not idtintical. Clark and Clark (1977) using a topic-comment formulation., would argue 
that the focus differs from active to passive: In 1). what is emphasized is some new 
information about t he lad. i.e.. t hat he rode the steed to victory, whereas in 2) the emphasis 
centers on new information about the horse, i.e.. that it was ridden to victory by the lad. 
We concur. If we adopt a strict equivalencc-in-mcaning criterion, then the concept of 
paraphrase (multiple surface structure representations for a single deep structure) cannot 
exist. Nonetheless, we would prohably find that 95 percent of the population would agr.': 
that the same basic notions are being communicated in active/ pjissivc pairs. 
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6. Because his arm gave out, Henry lost the quarterbaek 

job. 

7. Henry lost the quarterback job. His arm gave out. 

8. Henry's arm gave out. He lost the quarterback job. 

The difTerenee between (5) and (6) is like the difference between 
(1) and (2) above - simple grammatical transformation, in this 
case preposing a subordinate clause. The difference between (5) 
and (7) is not as simple. It is debatable whether or hot the causal 
relation between the two clauses is preserved in (7), unless we 
resort to the Gricean principle of cooperation between author 
and reader (Grice, 1975) which posits that no author would 
arrange the two sentences in (7) adjacent to one another unless he 
was inviting the reader to infer that the one explained the other. 
Notice that the invitation to make the causal inference is even 
stronger in (8), presumably because of the covariation between 
causal and sequential ordering. Whether variations like those 
between (5) and (7) qualify as grammatical variation is not clear. 
But to us they definitely qualify as variation in some aspect of text 
structure. In certain systems of text analysis they would be 
regarded as alterations in the rhetorical structure (e,g., Meyer, 
1975), cohesion structure (Halliday & Hasan, 1976), or logical 
structure (e.g., F-rederiksen, 1975) of the diSv;Gursc, Such vari- 
ations are abundant in naturally occurring discourse, as' exem- 
plified in (9-11)." 

^9a. If you want to bea Badger, thencomealongwithme, 

*9b. Do you want to be a Badger? Come along with me. 

10a. After Matthew ate lasagne, he bought a new TV. 

10b. Matthew ate lasagne. (Then) he bought a new TV. 

1 la. Although Susan ran as fast as she could, she lost the 

race. 

1 lb. Susan ran as fast as she could. But she lost the race. 

1 Ic. Susan ran as fast asshecould. Alas! She lost the race. 

Moving from smaller to larger units of discourse, other 
kinds of structural variation enter the picture. For example, the 
structural difference between (12) and (13) has been character- 
ized as a staging variation (Grimes, 1975). 
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12. Robins build nests in Irccs. FMicasants build nests in 
bushes. Eagles build nests in r<H:ks. Birds build nests in a variety 
of places. 

13. Birds build nests in a variety of places. Robins build 
nests in trees. Pheasants build nests in bushes. Eiiglcs buiid nests 
in rocks. 

In this case, the meaning of the two texts is similar if not identical; 
however the position of the rule and its examples is reversed. 
Notice that this type of variation isa paragraph analogueof posi- 
tional transformations (active/ passive or preposing classes) at 
the sentence level. 

Thus far we have considered variations in text structure 
that have only minor influence on the semantic meaning of a text. 
Furthermore, the structural variations considered occur between 
or among rather small units of discourse -sentence components 
or sentences. Such analyses can be regarded as examinations of 
the microstructure of text. 

Other, and particularly more recent, conceptuali/ations 
of text structure have ignored the perspective of examiningstruc- 
tural variations that preserve meaning in favor of a perspective 
that examines the hierarchical aspects of text structure. Such 
schemes usually begin with a parsing of an entire text using either 
a ease grammar (e.g., Meyer, 1975) or a propositional (e.g., 
Frcderiksen, 1975; Kintsch, 1974) scheme to identify relations 
within and between sentences. Then the entire text is analy/.ed 
into a hierarchical structure, ideas (usually in the form of prop- 
ositions — basically a clause with an active or stative verb) are 
scaled according to their structural importance within the hier- 
archy. For expository texts, importance translates roughly into 
how ''main" or superordinate the idea is. For narrative texts, 
importance means centrality to the story. Thus, characters, goals 
and settings are high in the hierarchy while particular episodes or 
motivations may be fairly low. 

Implicit if not explicit in such analyses are two expec- 
tations: first, that height in the hierarchy will somehow predict 
and/ or explain the comprehensibility or memorability of par- 
ticular text segments, and second that surface structures that 
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violate canonical structure will decimate comprehension and re- 
call. Such schemes can be regarded as examinations of the 
macrostructure of text. 

With these two aspects of text structure microstructure 
and macrostructure we have defined the scope' of our inves- 
tigation. Our next step is to examine the empirical studies that 
have been conducted to evaluate the importance of text structure 
in comprehension. 

Microstructure 1: The Primacy of the Sentence 

To psychologists academically reared in the verbal learn- 
ing tradition of the forties and fifties, the revolution incited by 
Noam Chomsky's penetrating reviews of behavioral views of 
language processing { 1959) and his alternative views proposed in 
Syfuacdr structures (1957) and Aspects of the theory of syntax 
(1965), must have seemed a bold departure from the conventional 
wisdom. 'I'he very notion that one could .study units of discourse 
as large and complex as a sentence was revolutionary to re- 
searchers more comfortable with lists of nonsense trigrams or 
quingrams or associations among single words. 

Nonetheless, Chomsky's views widened research po.ssibil- 
ities for students of verbal behavior. Beginning whh the work of 
Miller and his associates (e.g.. Miller, 1962; Miller Isard. 
1963) several researchers in the mid-sixties conducted studies 
of sentence comprehension. T he most common finding among 
such studies (e.g.. Gough. 1965; Siobin, 1966) was tluit the 
transformational distance between the underlying meaning 
of a sentence (its deep structure) and its phonetic realization 
in speech or graphic realization in writing (its surface 
structure) was an accurate predictor of the speed or difficulty 
subjects experience in processing the sentence. In other words, 
performance could be predicted by variation in the grammar 
itself. Hence kernel sentences (simple active declaratives) were 
understood more rapidly than passives, interrogatives, or 
negatives, which presumably required more cognitive energy to 
process because more transformations had to be traversed in 
traveling from surface to deep structure. Findings such as 
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these led lo a flerivational theory of complexity, i.e., that the 
derivation of. a sentence s' surface structure from its underlying 
deep structure predicted its processing dilTiculty. 

Such studies were appealing to' reading educators con- 
cerned specifically with reading comprehension. First, they 
provided needed methodological tools/Finally, there was a way 
of qperationalizing sentence complexity. The notion of a 
transformation provided a countable indc; of complexity. In 
fact. Pagan (1971) and Pearson (1974-75>- used a derivational 
theor\' of complexity to generate and scale materials used to 
assess children's comprehension of sentence structural 

■ Second, these studies corroborated what was known from 
(or at least implied by) the thirty year old history of readability 
research: that long complex sentences were associated with 
passages that rated high in readability and low in comprehensi- 

bility. ^ ' - . 

Unfortunately, the derivational theory of complexity 
lived only a short life. It was attacked on two different fronts, 
both as a linguistic theory and as a psychological- theory. . 

The work of Fillmore (1968) on case grammar and gen- 
erative semanticist framework, of linguists like Lakoff (1971) 
called into question the transformationalists' preoccupation with 
syntactic relations at the expense of semantic relations. 

In psychology, studies such as thos^^' conducted by 
Bransford and Franks ( 197 1 ) and Sachs ( 19 '^-^^ ^ fered data con- 
tradictory to the derivational theory of complexity. Implicit in 
the theory, is an assumption that comprehension occurs by 
analyzing a sentence into its basic constituents (that is how you 
get from surface to deep structure). Bransford and Franks' 
evidence suggested that comprehension was bet'ter characterized 
by synthesizing constituents- into some semantically integrated 
chunk. Sachs* data indicated that memory for any aspects of 
sentence structure faded; quite rapidly, while memory for the 

:An almost incidcnial bui nonetheless imporianl methodological tool was the question 
translormalion. Bomuilh ri9f>^ m'l 1971), Pears n (1974-1975), and Bormitth. 
Manning Carr. and Pearsc.n (1971) and many res^.irchers since then have used the 
question transformation as an objective levicc lor v-nierating literal comprehension 
question prohes from text. 
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semantic ''gist'' of a sentence was remarkably stable. Working 
with children, Pearscur^fl 974-75) obtained results corroborating 
with work of Bransford and Franks and Sachs. 

In some ways, however, the issue was soon to become a 
moot point because somewhere in the early seventies researchers 
turned their attention a\vay from the sentence as a unit of 
linguistic analysis in Tavor of stories, passages, and expositions™ 
with a concomitant emphasis on macrostructure rather than 
microstructure. Later we will examine that line of research: first, 
however, we must add two pieces to the microstructure puz/lc. 



Microstructure 2; Linguistic Connectives 

The small but interesting body of research dealing with 
linguistic connectives speaks incidentally to issues of structural 
variation. This stems from the fact that when a connective is used 
in a sentence, it often has the effect of increasing the grammatical 
corhplexity of the sentence; connectives are involved in ihe 
formation of compound sentences and subordinate clauses such 
as those beginning with because, ahhough. he/brc, etc. In a sense, 
an examination of linguistic connectives is a sensible bridge from 
the studies lookin;; at the sentence as a unit of analysis to those 
(in a later section) which emphasize the larger organizational 
patterns of text. Linguistic connectives usually establish or cue 
logical relations among propositions or sentences. 

Walmsley (1977) defines linguistic connectives as follows: 

/\ linguistic conneciivc {or logical or hinguagc connective the terms 
appear to he ascd according to the oriei^'ation of a writer's discipline) 
may be defined as a**co-ordinating.qualiiymgorad\'crbiaIconjunction 
used to link i simple proposition with another idea (either a proposition 
or a concepO to form a complex proposition." Allcrnati\ ely. it may be 
det'ined as i syntactic strnclure signalling underlying logico-semantic 
relations (f^,eeOkls. 1^68). Connectives may link propositions within or 
between syfntenccs; ihey rnay comprise a single word (e.g.. aru/), or a 
phrase (e/g.. in ndclifion to), (p. 319) 

Some/ researchers have examined the developmental 
changes tha/ occur across ages in children's understanding of 
cCiinectivc/and the relations between the propositions they link 
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(e.g., Beilin 8i Lust, 1975; Neimark, 1970; Neimark & Slotmick, 
1970; Paris, 1973). Not surprisingly, children's understanding 
improves with age; however, the research in this tradition, 
because of the nature of the task ard isolated (not contextually 
embedded) stimuli, offers Httle a:;\:ce concerning their role in 
reading larger units of discours e, 

Robertson (1968) conducted one of the few educationally 
oriented studies. Her examination of basal readers used in the 
intermediate grades revealed that about one in three sentences in 
her sample employed some sort of connective. Student compre- 
hension of connectives increased from grade four through grade 
six and was related to listening and reading ability. 

Katz and Brent (1968) found that both first and sixth 
grade children preferred descriptions of causal relations that 
were made explicit by the use of a connective. This is consistent 
with the findings of Pearson (1974-1975) who reported that 
fourth grade students, given a choice as to the surface form in 
which a causal relation could be stated, preferred to have the 
relation stated in a grammatically complex subordinated form 
which included specific cues (because, so) denoting causal 
relationships. Pearson speculated that connectives and com- 
plexity (they go together) provided "... a more m;///V<:/ concep- 
tion of the causal relation** (Pearson, p. 174) and that it is the 
function of connectives to make the causal relationship more 
explicit. The.se speculations were strengthened by the findings of 
a follow-up .study in which students were asked to read individual 
sentences in which a causal relationship was either made explicit 
by inclusion of a causal connective or left implicit by omittingthe 
connective. Results showed that in almost two-thirds of the 
cases in which subjects were asked to read sentences containing 
an unmarked (i.e., implicit) causal relationship, a connective w as 
mcluded in recall, thus unifying the relationship and making it 
explicit. Furthermore, if a sentence was not recalled in a cued, 
unified form, there was a 50 percent chance that it would not be 
recalled at alL These findings suggest that connectives have a 
strong effect on the salience of causal relationships expressed in 
sentences and may serve to facilitate the integration of ideas in 
memory. 
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Finally, Marshall and Glock (1978-1979) found that 
explicitly stated logical (i.e., causal and relational) structures 
facilitated the recall of propositional content or discourse for 
^/^not-so-fluent" (community college) readers. Recall for "truly- 
fluent" (college) -readers for the same passages Was more 
complete than for community college students and was not 
affected by the presence or absence of explicitly stated relation- 
ships. Structure of recall for good readers reflected a greater 
degree of differentiation among elements of the underlying 
structure of the text than did the recall of poorer readers who 
focused primarily on content. Marshall suggests that these 
differences are due to the fact that good readers have more welf 
established schemata that can be used to interpret and store'tfie 
meaning of discourse whereas poorer readers have less complete 
structure and, therefore, must depend to a greater extent on 
information explicitly encoded in the surface structure of text. 

Microstructurc 3: Sentence-Combining 

Perhaps the most obvious attempt to determine the 
influence of direct instruction in the microstructurc of text on 
comprehension has been in the tradition of sentence-combining 
instruction. Beginning with the observation that attempts to 
teach formal nrammar have little positive effect on students' 
writing ability (Braddock, Lloyd-Jones, & Schoer, 1963; Mellon, 
1969), researchers originally looked to sentence-combining as a 
way of influencing syntactic maturity in writing (Con«bs, 1975; 
Mellon, -1969; CrHare, 1973). More recently, however, research- 
ers have attempted to determine the' effects of sentence- 
combining training on reading comprehension (Combs, 1975: 
Fisher, 1973: Hughes, 1975: Hunt & O'Donnell. 1970: Straw, 
1979). 

Basically, sentence-combining activities require students 
to integrate into a single sentence information expressed in two 
or more sentences as in (14) and (15). 

14. The boy hit the ball. The boy was tall. The ball was 
small. He hit it through a window. 

15. The tall boy hit the small ball through a window. 
The rationale for believing that such instruction could alter 
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writers' syntactic maturity seems o'^' \ the reason for 
suspecting a concomitant influence reading. :omprehension 
stems from a view of the language irts th:it \ hat influences 
growth in one language capacity will innucncegi' wth inanother. 

While positive effects have beenicpnn . d on some limited 
measures of reading comprehension (Combs, 1975; Fisher, 1973; 
Hughes, 1975), only a study by Straw (1979) has looked at the 
effects of sentence-combining as they transfer to listening, 
reading, and writing. Straw found that a sentence-combining 
training condition affected growth in all three language 
capacities. However, growth in reading comprehension was 
limited to an investigator constructed cloze test; it did not affect 
growth on a st ndardized test. Interestingly, a complementary 
sentence-reduction task affected growth in reading comprehen- 
sion (to a lesser degree than did sentence-combining) but not 
growth on the writing and listening measures. For purposes of 
our review. Straw's effects are noteworthy even though the 
treatment etTects do not transfer to a standardized test. There is 
no good reason to believe that a typical standardized test will be 
sensitive to such instructional treatments. His results do suggest 
that attention to microstructure, specifically allowing students to 
actively ..manipulate it, pays at least short range dividends in 
comprehension growth. 

These results seem compatible with those in the review of 
linguistic 'connectives. Note that linguistic connectives often 
serve the function of combining ideas that could be expressed in 
separate sentences. Ironically, then, these two areas of research 
suggest, in contrast to the earlier work in transformational 
grammar/that attention to cohesion rather than atomization of 
sentence elemi^nts pays greater dividends, 

Macrostructure 1: Narratives 

The main purpose of this section is to review research 
evaluating the intluence of the overall structure of narratives on 
students' comprehension and recall of information presented in 
texts. Several writers (e.g., Mandler & Johnson, 1977; Rumel- 
hart, 1975; Stein & Glenn, 1977; Thorndyke, 1977) have 
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developed formalisms for analyzing the relations among 
propositions in stories. Propositions can be related in two ways: 
by their relative position within the hierarchy of a story and by 
their rhetorical function. 

Like phrase structure grammars applied to sentences (e.g., 
Chomsky, 1957, 1965). in which rewrite rules dictated a 
sentence's decomposition (e.g.. Sentence =>Noun Phrase + 
Verb Phrase, Noun Phrase =>Determincr + Noun + [Sentence], 
Verb Phrase =>Vcrb + [Sentence]), so story schemata or story 
grammars specify a set of rewrite rules for decomposing the 
relations among propositions in a story. Thus a story can be 
rewritten as STOUV ==>snTTiNc; + tiIhnu-; + vlot + rhsoujtion; 
setting can be rewritten as sHTTiNG = >rnAKA(TiiKS + loca tion 
+ TiMiv, etc. When all the rewrite rules have been applied to a 
story, what results is an inverted tree diagram for a story, which 
looks quite similar to a phrase structure parsing of a sentence, 
except that the basic units arc sentences or propositions rather 
than words. 

In essence, this tree structure creates a hierarchy. Wtfat 
appear at the top of the hierarchy are the setting of the story 
(including characters, location, etc.), the basic theme, a few of the 
key episodes in the plot, and a resolution of the problem that 
motivated the characters to whatever actions they undertook to 
begin with. At lower levels in the hierarchy will be subplots. For 
example, suppose a character needed to get a car to drive to a 
beach so that he could dig clams for an important dinner. The 
activities in the story that were associated with getting the car 
would appear lower in the hierarchy because they were 
instrumental \n allowirm i\ higher level event (getting to the 
beach) to occur. Further suppose that in order to rent a car, the 
character had to phone several friends to borrow money. Those 
events would appear at an even lower level. Such hierarchical 
relations exist among propositions throughout the story; often 
the implied link between a higher and an immediately lower level 
event is causation or enahlement (a very weak sister to 
causation A allowed or enabled B to occur but did not really 
compel B to occur). 
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In addition, some grammars have established intracale- 
gory connectors to allow for explicit logical connection between 
events or states at the same level in the hierarchy. Stein and Glenn 
(1977), for example; include and, tiikn, and cause links. Hence 
rhetorical or logical connection between events and states is 
carried in two ways: vertically by implied hierarchical relations 
and horizontally by explicit links in the grammar. 

Story grammarians have postulated two possible conse- 
quences of story grammars. Assuming that students internalize, 
through constant exposure to storica of various degrees of well- 
formedness, something like a schema for stories, then compre- 
hension and recall of stories ought td'bc influenced by two kinds 
of variation. First, information in higher level nodes ought to be 
recalled more frequently than that in lower level nodes because of 
greater centrality to the basic actions and motivations oi the 
characters. Second, violations in the well-formedness of stories 
(e.g., the degree to which the order of key events is reversed or 
scrambled, placing motivations out of synchronization with 
actions, placing setting information at the end of a story, etc.) 
ought to decrease comprehension and recall. 

The first of these predictions has been emphasized by 
Rumelhart (1975, 1977). He has established a set of story 
summarization rules to predict the probability that a proposition 
will be recalled; basically a proposition is predicted to be recalled 
if a proposition lower in the hierarchy was recalled. He found 
that the conditional probability that a proposition would be 
recalled given that it was predicted to be recalled was .95. 
Rumelhart also interprets the data from the work oif Thorndyke 
(1977) and Meyer (1975) as supporting his hierarchical 
hypothesis. 

Other researchers (e.g., Mandlcr & Johnson, 1977; Stein 
& Nezworski, 1978; Thorndyke, 1977) have emphasized 
the effects in violation of canonical story .structure. Thorn- 
dyke-(1977) found that story recall was debilitated increasingly 
by a) moving the theme or goal to the end of the story, b) 
removing the theme altogether, and c) more or less randomly 
permuting the sentences in the story. Kintsch, Mandel, and 
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Kozminsky (1977) asked college students to read well- and ill- 
formed (scrambled paragraphs) stories in time-limitcd or 
unlimited conditions. Then the students wrote summaries of the 
1400 word stories. In the unlimited reading time c ondition, there 
was a 23 percent increase in reading time due to scrambling hut 
no differences in' writing time, length, or quality of story 
summaries. However in the limited time condition, better 
summaries were written for well-formed stories. Kintsch et al. felt 
that subjects in the unlimited time condition imposed a story 
structure on the scrambled text at the point of comprehension 
rather than simply at the point of summarization; hence the 
difference in reading but not summarizing time. Without time to 
restructure the ill-formed story, comprehension suffered, result- 
ing in inferior summaries. Stein and Nezworski (1978) found 
results similar to those ofl*horndyke(l977). Well-formed stories 
elicited better story recall than stories containing slightly 
disordered, randomly ordered, or unrelated statcinents. Further- 
more, unrelated statements elicited the greatest number of 
inferences into recall, reflecting subjects' attempts to make sense 
out of an incoherent text, a finding reminiscent of Bartlett's 
(1932) early results on cross-cultural intrusions into story recall. 

The developmental (cross-age) data collected by Stein and 
Glenn (1977) and Mandler and Johnson (1977) also support the 
notion of story schemata. As children grow older they tend to 
recall increasingly more of the lower level information in the 
story. Young children tend to recall only a few of the higher level 
propositions such as a character, an initiating event, and an 
outcome. 

While story schemata have been criticized for their cm- 
pha.sis on prediction rather than explanation and the fact that 
they predict too many behaviors (Thorndyke & Yecovich, in 
press), their basic validity as formalizations of what people learn 
when they learn about how writers put stories together seems to 
us to be well-founded. 

Imtnictional Research on Story Schemata 

♦ We were able to locate only three studies dealing even 
tangentially with issues of direct instruction about how stories 
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are structured. Bower (1976) had subjects read a biography about 
a fictitious poet. Then half the subjects read two biographies \Vith 
similar macrostructures while half read two unrelated texts. 
When they were- asked to recall the original biography, 
experimental subjects (the three biographies) recalled more of the 
macrostructure (which was similar in all three) but interconfused 
details of the second and third with the first. The similarity of the 
three passages created macrostructure facilitation and detail 
interference. Thorndyke (1977) found that subjects who read a 
second story with the same structure as, but different characters 
from, the first story recalled more second story information than 
those whose second stories had the same characters as, but a 
different structure from, the first. 

Neither of these studies can be considered instructional in 
anything but an incidental sense. However, a study by Gordon 
(1980) speaks directly to story structure instruction. Over a 
period of eight weeks she trained fifth grade students to apply a 
simplified story schema to basal reader stories that they read as a 
part of their normal reading instruction. On a transfer story, 
these students recalled significantly more, parficularly of certain 
categories of high-level information, than a placebo or an 
untreated control group. She interpreted the findings as 
supporting the notion that direct instruction in story schemata 
provides students with a transferable framework for storing and 
retrieving textually presented information. 

Macrostructure 2: Exposition 

Research and theory about the macrostructure \f 
expository text is not quite so abundant as that for narrative. 
Attempts have been made by Kintsch(1974), Frederiksen(1975), 
arid Meyer (1975) to develop general schemes for representing 
relations among units of text. Kintsch and Frederiksen give more 
emphasis to a scheme that could serve as either a model of text 
structure or the structure of knowledge in memory; Meyer's 
system is, admittedly, more concerned with representing text per 
se. Because of space limitations and because it places greater 
emphasis on text macrostructure, we have chosen to concentrate 
on Meyer's system, recognizing full well that we can justify our 
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decision only by asserting that we intend our review to represent 
an example from a class of text structure schemes. 

Adhering closely to the theory of Grimes (1975) for 
connected discourse and to Fillmore's (1968) case grammar, 
Meyer has developed a text structure system that emphasizes 
relations among propositions in a text. She has lexical proposi- 
tions that show the case relations between words within simple 
sentences and clauses. And she has rhetorical propositions which 
establish the relations between and among sentences^ paragraphs, 
and longer units of text. Rhetorical predicates are labels used to 
specify the relationships within these propositions. Rhetorical 
predicates order the ideas in a text into hierarchical relationships, 
and they allow Meyer to develop a richer, higher-level organiza- 
tion than either Kintsch or Frederiksen, 

Meyer's parsing of a passage looks much like an outline of 
the passage, except that all the ideas from the passage are 
included. Top-level discourse structures in the outline arc simply 
the relations that occur in the top third of the diagram. Height in 
the system is indicated by "leftness'' of a proposition in the 
content outline. 

A basic thesis of Meyer's is that height in the hierarchy 
predicts how well propositions will be comprehended and 
recalled. She designed an experiment in which a target paragraph 
was embedded high within the hierarchy of passage I but low in 
passage 2. The serial position of the paragraph was identical 
across passages. While she found no overall recall differences 
between" the two passages, the target paragraph was recalled 
better when it was staged higher in the hierarchy. These 
immediate recall differences increased with a week's delay. 
Similar differences were noted in the cued recall of the target 
passage after a week's interval. 

Meyer (1977a, 1977b) extended her research to determine 
whether or not sixth grade students were sensitive to these 
hierarchical differences in content structure. Meyer predicted 
that students classified as low in ability would recall more 
information from low levels of the content structure than from 
high levels in the content structure. 
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Immediately after listening to a short article about 
parakeets, students answered fiftten main idea and fifteen detail 
questions about the article. The main idea questions were derived 
from idea units high in the structure of the passage, and the detail 
questions were derived from idea units low in the structure of the 
passage. Results indicated that all of the students, regardless of 
ability level, answered more main idea questions than detail 
questions. Brighter students remembered significantly more 
information from both levels of the structure than other students, 
but even low-ability students answered more main idea questions 
correctly than detail questions. Meyer concluded that children, 
like adults, remember more information from high levels of the 
content structure of a text and that a content structure 
representation can be useful in generating different types of 
comprehension questions for prose materials. Meyer cautions, 
however, that the results of this study might not generalize to 
low-ability students with reading or learning disabilities under 
reading versus listening conditions. 

In order to explore the effects of different types of top- 
level discourse structures on recall, Meyer and Freedle(1979) had 
graduate students read articles with identical middle- and low- 
level structures and content. The passages differed in the way 
similar introductory information -vas organized in the top-level 
of their content structure diagrams. The four types of structures 
(rhetorical predicates she calls them) compared in the study were: 
adversative (conirastive pattern), covariance (causes-effect pat- 
tern), response (problem-solution pattern), and attributive (list- 
like pattern). The investigators predicted that inforriation in 
passages organized with adversative, covariance, and response 
structures would be remembered better than information from 
the passage organized with an attributive structure. 

Subjects participating in the study were graduate students 
working on advanced degrees in education. They were divided 
into four groups, and each group listened to a passage organized 
by one of the four rhetorical predicates. An immediate free-recall 
test, a delayed free-recall test, and a delayed short-answer test 
were administered to all subjects. The short-answer test consisted 
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of questio,ns,.yyhich tapped memory for information that was 
identtcai in each of the four passage conditions. Recall protocols 
were scored for the number of idea units recalled and for the type 
of rhetorical structure subjects used to organize their recall 
p;rotocols. The short-answer test v'-a ^itn;*')' scored for the 
number of correct answers. 

Subjects who listened to pas.sages organized with ad- 
versative fconstrastive) and covariance (cause-effect) structures 
remembered significantly more information than subjects who 
listened to passages organized with attribution (list-like) and 
response (problem-solution) structures. Moreover, subjects who 
listened id the adversative passage answered significantly more of 
the short-answer questions correctly than subjects who listened 
to the other passages. Subjects who listened to passages with 
adversative and covariance structures also used these types of 
relationships to organi/.e their recall protocols. 

From these findings, Meyer and Freedle concluded that 
differences in the type of structure used to organize textual 
information significantly affected the amount of information 
graduate students learned and remembered. Adversative and 
covariance organizations enhanced recall over attribution and 
response organizations. 

Using a schema theory orientatioti, Meyer and Freedle 
had predicted that adversative, covariance, and response 
structures would provide better organization for learning than an 
attributive, list-like structure. Each of the four types of structure 
is used in expository texts to let readers know inforrnation will be 
presented about a topic: but adversative, covariance, and 
response structures ostensibly provide readers with additional 
schemata to help them understand and remember the informa- 
tion. For example, an adversative structure indicates that the 
information will be about opposing views; covariance structures 
indicate that the information will be about causal relations; and 
response structures indicate that the information will be about 
problems and solutions. Attributive structures are more loosely 
organized, however, and do not provide additional information. 
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The prediction that adversative and covariance structures 
would facilitate recall was supported. The prediction about the 
response structure, however, was not confirmed. Meyer and 
Freedle explain this unexpected finding in terms of social- 
psychological facts and notions of perspective. The subjects 
participating in the study were school teachers who may have 
been offended by the solution in the response passage as it 
involved firing coaches. Thus, the teachers seemed to reject the 
schema provided by the author, read the text from their own 
perspective or personal viewpoint, and thereby processed the text 
differently than was expected. 

The most important finding in this study was that certain 
types of top-level discourse structures did facilitate recall more 
than others. Meyer and Freedle interpret the results of this study 
as showing that the most efficient strategy students can adopt in 
typical school-learning or lab-learning situations is to identify 
and use the author's organizational framework to guide and 
structure their attempts to understand and remember informa- 
tion from textual materials. Students who are familiar with the 
way texts are typically organized can use that knowledge to 
comprehend and remember by relating the organizational 
structure, or schema, of the text to their prior knowledge (stored 
schemata) about how texts are organized and what to expect 
from texts orgatiized in certain ways. 

Meyer, Brandt, and Bluth (1978) investigated the effects 
of identifying and using the organisational structure of texts on 
recall. They predicted that readers who adopted the strategy of 
identifying the author's organizational structure (the author's 
schema) would be able to recall more information than students 
who did not adopt this strategy. Ninth grade students classified as 
good, average, poor, and "difference" (high vocabulary but low 
comprehension scores) readers participated in the study. They 
read and recalled two different expository passages. One passage 
was organized with a response predicate and the other witH an 
adversative predicate. Thus, the passages differed in their top- 
level rhetorical structures and, also, in whether or not signaling 
devices were present in the texts. 
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Signaling devices, as defined by Meyer ( 1975), are ways in 
which authors emphasize aspects of the semantic content or 
structure of a text. The title ofthe passage and words such as*'in 
contrast to'' were types of signaling used in the adversative 
passage. An explicit statement of the problem and solution 
relations as well as signaling words such as ''fii-st," ''second," etc, 
were the types of signaling included in the response passage, 
Meyer et al. predicted that signaling devices would beneiit poor 
and ''difference'' readers in processing the texts as it was assumed 
that these readers did not normally use the organisational 
structure of texts to understand and remember information. 

Immediate and delayed free-recall tests were scored for 
the number of idea units recalled and the degree of similarity 
between the organization of the recall protocols and that of the 
original passages. Results indicated that good readers organized 
their protocols with the same structure as that used in the 
passages they read and that they recalled significantly more 
information than students who did not adopt this strategy. This 
result was obtained with good readers even when signaling 
devices were not present in the texts they read. The strategy of 
using the author's "schema" to organize recalled information was 
a better predictor of recall than either standardized comprehen- 
sion or vocabulary test scores; multiple regression analyses 
indicated that use of this strategy accounted for 44 percent ofthe 
variance in recall on the immediate-recall test and 68 percent of 
the variance in recall on the delayed test. 

Signaling appeared to facilitate recall of low and average 
comprehenders on the immediate test but not on the delayed- 
rccall test. On the immediate test, the students classified as poor, 
average, and "difference'* readers who read the response passage 
with signaling organized their recall protocols with the same 
pattern of relationships as those in the original passage; they also 
recalled significantly more information than similarly classified 
students who read the without-signaling version ofthe passage. 
However, signaling had no effect on the recall of students who 
read 'the without-signaling and adversative versions of the 
passage. 
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In a subsequent study, Bartlett (1978) taught a group of 
ninth grade students to identify various types of top-level 
structures comn-ion to expository texts and to use the structures 
to organize their recall protocols. The students were taught how 
to identify and to use covariance, ad\'crsative, attribution, and 
response structures during a week-long training period. Appar- 
ently, students trained to use the strategy of identifying an 
authofs top-level structure were able to recognize these 
structures in texts significantly better than students who did not 
receive training; and trained students were able to recall nearly 
twice as much information after reading than students who did not 
receive training. Thus, some evidence exists to suggest that 
students can be taught to identify top-level discourse structures 
and that such training improves comprehension. 

Implications for Reading Practice 

It is always somewhat dangerous to leap too boldly across 
the gap from research, especially basic research, to educational 
practice. A more cautious approach is to suggest that research 
findings from laboratory or other basic research settings should 
be. regarded as grist for applied research studies which should be 
carried out in real school environments before we make any 
conclusive recommendations for changes in materials or teaching 
strategies (e.g., Bronfenbrenner, 1976). Nonetheless, we see sev- 
eral areas in which the leap seems so reasonable and inviting that 
we make it, caution notwithstanding. 

Recommendations to People Who Prepare 
Reading Materials 

1. The research on children's comprehension of story 
structure suggests to us that from the outset of grade one, 
children ought to be reading stories that are highly predictable in 
terms of their conformity to canonical story schemata. We 
recognize that the need to control vocabulary in the earliest of 
stories makes it difHcuIt for w liters to create well-formed stories. 
Yet we are convinced that it is these young children who need th j 
predictability the most. Consider the case of a first grade stud... 
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who is trying to make sense out of the unfamiliar orthography of 
English writing. The child is already confronted with one source 
of potential confusion (figuring out what sounds the letters 
make); to embed that task in a context that can be another source 
of confusion (stories that violate story schemata) seems to 
compound the problem. 

2. The research on the influence ofconnectives, structures 
of cohesion, and sentence combining activities suggests that 
complexity may sometimes add to rather than always detract 
from the likeliliood that comprehension will occur. Cohesive 
forms of statements appear to make explicit what is otherwise left 
to children's inferential powers. Textbook writers need to be 
aware of this fact, /vbove all they should not be led to the false 
conclusion that writing becomes more readable when complex 
sentences are chopped in half (even though such a practice will 
reduce a passage's readability scores). 

Recommendations to Educators (and Writers of 
Instructional Practice Materials) 

3. The salutary on sentence-combining training has been 
r:T)licated several times. It seems reasonable to recommend that 
students be given an opportunity to learn an important fact about 
th:: English language; that there is always uiorc than one way to 
express a given idea. Awareness of this fact also apparently leads 
tr gro\^th in syntactic maturity and listening, particularly if the 
f/cus is on creaiing cohesive statements. 

4. If teachers want students to "get the author's message/' 
'^rx are well ad\ Ned to model for students how to figure out what 

the author's general framework or structure is and then allow 
students to practice discovering it on their own. They should be 
cautioned, however, that not all reading has as its purpose 
''getting the author's message"; sometimes students need to read 
to update their own knowledge, in which case they arc probably 
better off working within their own schemata rather than an 
author's schema (Spiro, 1977). 

We clo.sc this section with a disclaimer: These are not the 
oniy suggestions which could be derived from the research base 
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on comprehension of text structure, only those th. >cc led most 
reasonable to use. Also, these may not be the rr. oi^if important 
implications to be derived for practice from research; it may be 
that the research based on the structure of knowledge in memory 
or the interaction between text structure and knowledge struc- 
ture or the process of learning to monitor one's own comprehen- 
sion may prove more fruitful for instruction. They simply do not 
fall under the scope of this review. 

Implications for Instructional ^leseurch in Reading 

1. We need to know more about the point in time when 
children are able to handle certain complex kinds of syntactic 
structures. There was a time in the late sixties when the 
conventional wisdom concerning syntactic development seemed 
to suggest that, by the age of six, children had mastered nearly all 
the syntactic structures they would use as adults. Then the wqrk 
of C. Chomsky (1969), Bormuth, Manning, Carr, and Pearson 
( 1 97 1), Olds ( 1 968), and others pointed out that even by age ten 
children still had trouble within many structures. Somehow the 
rush toward semantic and macrostructural concerns in the mid- 
seventies buried what was an incomplete and fruitful line of 
research. We still need to finish the job. 

2. After the issue of development comes instructions: Are 
those structures which cause difficulty even for the ten-year-old 
amenable to direct instruction and systematic practice? 

3* The work of Meyer and her associates suggests that 
good readers. are better at following an author's rhetorical plan of 
organization than are poor readers. The next step is to 
demonstrate that poor readers who receive direct instruction in 
deciphering an author's organizational plan improve in their 
ability to produce greater veridical comprehension and recall of 
text. 

4. In this regard, we need to know more about the relative 
efficiency of different rhetorical plans of organization (e.g., 
adversative, covariance, attributional, etc.) in communicating 
content in various disciplines (science versus history versus 
geography). It may be that certain plans are uniquely suited to 
certain types of content. 
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5. i he work of Gordon (1980) should be extended to 
younger age levels to see if the salutory effects of story schema 
training wilT assist even younger students. In this regard, we 
should mention the exciting but emerging work of a group of 
researchers in Boston (Rubin, 1980) who are using a stoi y schema 
framework to help young children get (^ff to a faster start in 
writing as well as reading stories. 

6. Finally let us offer one general suggestion for 
instructional research derived from constructs emanating from 
basic research. When we look at research on teaching and 
learning variables, we have been awestruck by the persistence and 
ubiquity of two terms: engaged time on task' and direct 
instruction (e.g., Becker, 1977: Berliner. 1975: Rosenshine, 1976). 
We finally seem to be getting the message that kids learn what 

'icy arc taught and get to practice. Thus far. the l esearch seems 
to have shown these effects in more mundane aspects of reading 
such as word identification. But there is no reason to believe that 
they wouldn't aid comprehension as well, even though we have 
evidence that few teachers teach comprehension (Dnrkin. 1978- 
1979). fn fact, the, work of Straw (1978) and Gordon (1980) 
reported earlier, as well as a recent study by Hansen (1979) seem 
to provide direct evidence that students learn new strategies for 
comprehending text when they are taught and practiced 
systematically. 1 he point is simple; when we identify a variable, 
including a text striicturc variable, that looks like it might make a 
difference in comprehension, we ought to adopt a frontal assault 
strategy when considering its instructional power teach about it 
systematically and make certain students have a chance to 
practice it. The time for a renaissance of the niethodological 
study is now now that we have a better idea of what io look at. 

A final caution: we don't expect that the products of 
this new methodological research will be altogetfier new and 
surprising. In fact, we expect that many will elicit reactions of 
'Reinventing the wheel.'' or '*that\s just common sense." Such 
reactions will please us. Common sense is all too common and all 
too sensible to be overturned by a single line of research. But no 
real value in the new research \v\\\ be the contextual and 
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theoretical base from which it emanates. Hence we will be in a 
better position to answer the question, Why did it work? 
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On the Making of Inferences During Reading 
and Their Assessment 



Thomas IVabasso 
University of Chicago 

In this paper, I would like to indicate how the making of 
inferences plays a role in the comprehension of narratives. In so 
doing, seven questions are posed and answers to each question 
are discussed. We as! . first, for a definition of\vhai is meant by 
inferences. Then we explore what functions are perfornied by 
inferences. This is followed by a consideration oiwhat is required 
to make inferences and what processes are involved in making 
inferences. Next, we discuss what kinds of inferences there are 
and take up the relationship between the kinds of inferences and 
how one can assess a child's ability to make inferences. Finally, 
we end on the practical and educational question oi whether one 
can promote comprehension through the asking of inferential 
questions during reading. 

What Are Inferences? 

What does a reader do when he makes an inference? From 
our perspective, he docs one of two things: he either finds 
semantic and/or logical relations between propositions orevents 
Which are expressed, in the narrative or he fills in missing 
information which is necessary to making such connections 
between events. The Trst k ind of inferencing has been called "text 

The writing of this paper wiis supported by National Institute of Education grant Nn:-G- 
77-00 1« to T..Tra basso and. in addition, under contract >. us-NiE-c-400-76-0()16. 
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connecting" and the second, "slot-i'illing" (Warr , Nicholas, & 
Trabasso, 1979). 'I'hcsc descriptive terms for making inferences 
come from a recent theory of natural language understanding 
and memory (Schank, 1975). According to this view jhe process 
of understanding: largely one of translating a series of sentences 
into a causal chain of underlying eonceptuali/ations. Hach 
sentence consists of one or more conceptualizations which must 
be derived t/om the surface structure expressed in the text. 1*his 
process involves linguistic and world knowledge about individual 
word meanings and relations within a sentence. The process of 
creating the causal chain, however, involves inference generation: 
11ie reader is assumed to read a story to generate the causal chain 
and the memory representation, and to encode events that are 
explicit along with those that are inferred. This representation in 
memoT is then" used to perform a \'aricty of operations such as 
retelling or recalling the story, summarizing the story, detecting 
the main ideas, deciding which events occurred in which 
temporal order, ansv/ering probe questions as to causes, eonsc^ 
qucnces or facts, paraphrasing events, and giving different points 
of view of the narrative. 

Ail of the above activities may be recognized as either 
related to comprehension or susceptible to being captured in 
comprehension tasks (cf. f\^arson & Johnson, I97S). The 
important contribution of Schnnk (1975) is the stress on the 
initial understanding by the reader and on the question of what 
representation of the story is constructed as a result of this 
understanding at the lime of reading. If the reader (or listener) 
should fail to construct the relations between events, explicit or 
inferred, then the subsequent activities would not he possible, the 
reader having no memorial basis for performing them: Represen- 
tation results from and requires an initial understanding of 
sentences and their relations and, in turn, precedes all other 
forms of comprehension. 

The assumption^ here is that the representation or 
understanding of a story is essentially a chronology of alternating 
events and stales with causal links. This idea is hardly novel. In 
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fact, Dcwcy (1963) seemed to have had a similar notion in mind 
when, in 1933, he wrote on "meaning,'' 

To grasp the meaning ot a thing, an event or a situation is to sec it In its 
relations to other things; to note how it operates or functions, what 
consequences lollow iVona it; what causes it, what uses it can he put to 
(p, 135). 

In the above qu.»te, note the emphasis on relations to other 
things, notably causes and consequences, 

Since the making of inA:rcnccs is a highly automatic and 
largely uncon,scious process, it is necessary at the outset to use 
illustrations, both to demonstrate what is meant by an inference 
and to make it elear'that the making of an inference, while highly 
automatic, is not a simple or obvious process, Fhis should 
become clearer when we try to understand what the process is and 
how it is made to operate. 

Consider the following pair of sentences, taken from 
Bransford and MeCarrell (1975): 

1. John missed the bus, 

2. He knew he would have to walk to school. 

Note first that there is no explicit causal connection between (1) 
and (2). Therefore the reader, when confronted with this pair of 
sentences, would have to make assumptions about the connec- 
tions between (I) and (2) in order to understand them. If these 
sentences occurred in the order (I), then (2), the der might 
infer that (1) was the causal antecedent of (2) a. .i n ovide the 
connective ''so,'' "and then,'' "thus,'' or "as a result " v The fact 
that we automatically assume (1) to be the cause ot (2) becwmes 
more apparent when we try to interpret the following sentence: 
3. John n^'sscc! t?ic h-js hrcamc he knew he would have 
to walk to school 

In (3), the causc/c ns ;^ jnce relations of (1) and (2) are now 
reversed. Presumably, John wanted to walk to school ami so he 
may have deliberately missed the bus. 

Again, consider two more examples from Bransford and 
MeCarrell (1975): 

4. The mirror broke. 

5. The child grabbed the broom. 
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We automatically assume that cvlmU (4) is the temporal and 
causal antecedent of event (5) and we 111! in the relation as 
expressed by connectives sucli as '*so" or ''therefore." However, 
our assumptions about cause and effect are apparent when we 
encountcjr event (6) which is contrary to the assumed cause/ 
eonseqiio'nce relation. 

6xThc mirror broke /^rcrnoc the child grabbed the broom. 
V/iicn connectives or relations are not explicitly flagged by 
syntactic markers in text, then readers infer them based upon 
temporal sequence and causal knowledge of the world. When 
connectives are explicitly stated, they are used to guide 
assumpfions about causes or conset|uences in order to compre- 
hend what we read. It makes a great deal of difference in events 
(3). and (6) how we interpret .lohn\s or the child's motives and 
responsibilities for actions or conscquen-.vs. 

Nicholas and Trabasso ( I979)cite another example which 
we shall use to illustrate first what inferences are and which ones 
appear to be necessary to underscanding text. Then, in the next 
section, we shall use the example to illustrate functions of 
inferences. 

Suppose you heard the line: 
7. Mary had a little lamb. 

What do you think of? Nursery rhymes? Mother Goose? Uitlc 
girls? Fleecy frolicking lambs? Now, read event (7) in conjunction 
with each of the follov ing events and note how your 
interpretation shifts. 

8. Its fleece was white as snow. 

9. She spilled gnwy and mint jelly on her dress. 

10. The delivery was a difficult one and afterwards the \ ct 
needed a drink. 

-What assumptions appear necessary to understand event pairs 
(7) and (8), (7) and (9). and (7) and ( lOV' In event pair (7) and (8). 
we use our knowledge to infer that M: ry is a character from a 
well-known nursery rhyme a little girl v/ho is followed about by 
her pet iamb. The verb "had" alludes to ownership, and the 
animal is alive and well. 
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In (7) anil (9). the sheep has not larcci so well. Here 
"Mar\'" is probably human and I'cmale since the pronoun 'Njic'' 
and The noun "dress*' allow this uiferenee. "Mary" may also be a 
ehiki since children are more likely'l'o spill food on themselves. 
*['hc references to gravy and mint jelly indicate, however, that the 
lamb is actually a meal, not a pet. 

I'inally, in (7) and (lO). the reterenees in ihc \elerinarian 
and to a difiieult delivery suggest that Mary had given birth to a 
small lamb and is. herself, a mature, lemalc sheep. The \"et is 
prot)ably an adult, human being whose prof'/ssion is to tentl to 
sick animals. The drink is likely to be alc-oholic and is presumably 
taken to enable the vet to relax alter tiie diffienh dch\er\' of the 
newborn lamb. 

Note the \ast lange n\ a^sunifMions and k now ledge that 
is neces>,ar\' lo understand these pairs of e\'ents. We neetl lo know 
about nurscr\- rhymes, ownership, pets, little giris. sheep, food, 
animal births. \'eterinarians. and alcohol. This kiurvvledge is used 
to construct an interprctaticm of (7) in the light ol (S), (0) or (10). 
Note, also, that (7) is an inherently amhigutuis sentence and that 
events (Hi (9) or (If)) iinokc knowledge about three radically 
diffcrcnr ..;ontc.\ts in order to infer inl'ormation that is implicit in 
the message. The activation of the knowletlge contained in (S). (9) 
or ( 10) appeal s ■necessary in order to int-.-rpret (7) itt each ol its 
\'arious meanings. 

/ia( l-rnn'iion.s Do liilcrou cs /'cr/i^rni'/ 

Inferences perform a \ariet>' of 1 unciions and by 
indicating this di\crsit\' lhro\igh definitions and examples, we 
may more fully appreciate their complexuy. 

First, micnded meanings of indivichial WDrds are olien 
and^iguous and n^ist be arrived at infcrcntially. Tims, one 
function of inferences is to resoKe lexical ambiguity. In the above 
"MaiA'" sentences, for example, the word ".had'' ma\' be 
interp* respectively as: 

owned or possessed (e\ents 7 aiul S) 

ate (events 7 and 9) 

or izave birth to (events 7 and 10) 
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"Lamb" may -be interpreted, respectix ely, as: 
a living animal, (events 7 and 8) 
a prepared meal (events 7 and 9) ^ 
a newborn sheep (events 7 anjd 10) 

A second function of infei>>iices is to resolve nominal and 
pronominal references (anaphora). Again, in the abo\c exam- 

^•It.s' refers to the bmb and not Mary in (7) and (S 
"She" refers to Mai^v and not the lamb in (7) and (9). 
"Halving lamb" refersUb delivery or birth in (7) and (10). 

In order to interpret sentences while \ve read, we need to 
establish ^icontext. This context is also arrived at inferentially. in 
the above examples, three contexts or topics are inferred: 

nursery rhyme in (7) and (8) ' 

meal in (7) and (9) 

birth in (7) and (10) 

A related, fourth function is that inferences aid in 
establishing a larger framework for interpretation. We shall now 
present three sentences used by Collins, Brown, and Larkin (in 
press) to illustrate how we construct and reconstruct ''models" 
(frameworks) from given information. When one is processing 
the sentences given as data for constructing a Framework, the 
procedure is said to be. "bottom-up." Once thcf' model is 
constructed and is used' to interpret new information, the 
processiiig is. said to be "top-down." The initial step, upon 
reading sentence (II). is "bottom-up," but once the 'MiiodeP" is 
established, we*' use it in a "top-down" to guide further 
interpretation'. Some "models" are" inappropriate or ctmnOt 
accommodate the subsequent events and are, hence, abandoned. 
New "models" must be inferred. So, read and think about your 
""models" as yoif progress through evcnt.s (I 1), (12) and (13): 

11. He plunked down S5.00,at the window! 

12. She-tried to give him S2.5(n^Khe refused to take ii 

13. when they got inside, she bougTrt him a large bag c^f 
popcorn.. 

In studying (II). Collins et al. (in press) found that subjects 
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interpreted the window as that at a racetrack and the $5,00 a bet. 
Probably, the verb, ''plunking down," led to tbis interpretation 
since this term isjargon used by bettors lor the act oTmakingbets. 
However, this "model" undergoes reinterpretation in (12) since 
the attempt to give back $2.50 and its reaction arc incongruous 
with the aniounts normally bet at racetracks and with what 
appears to be the returning of change during a business 
transaction. Event ( 13) aids in constructing a new model, namely, 
going dutch on a date to the.movies. The Collins et al. (in press) 
examples illustrate what is meant by an interactive model (see 
Rumelhart, 1977 for a discussi6n of these kinds of models). The 
central point, though, in the examples is that inductive 
reasoning is initially involved in constructing the model. Once 
CO acted, the process becomes top-down. 

Once a model is.constructed it enables the prediction of a 
number of events, including probable pre-conditions, causes and 
consequences of actions, emotional reactions, goals, etc. Those 
predictions arc what guide the assimilation of new information 
into old and underlie the intense current interest in scliemata 
(Bartlett, ' 1932), frames (Minsky, 1975), story grammars 
(Mandler & Johnson, 1977; Rumelhart, 1975: Stein & Glenn, 
1979), scripts (Schank & Abelson, 1977) and other organized 
knowledge bases for comprehension, • 

In the ''Mary" examples, when one combines events (7) 
and (9), a precondition is that the meal was prepared; an 
inlbrence is that Mary was hungry and likes Iamb; and a 
prediction is that since her dress is soiled her mother may become 
angry and Mary might be punished, 

^Vluu Is Required to Make Inferences'/ 

It is clear that background knowledge is needed to make 
inferences. What the reader knows or has experienced prior to 
reading a is critical, and the reader's knowledge of the world 
or procedural knowledge may be decomposed into a number of 
knowledge domains. One implication of this is that if we want to 
enhance a child's comprehension of what he reads, wc would do 
well to increase his general knowledge and understanding as well 
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as teach him specific reading skills, after he has learned to decode. 
In addition, vocabulary (conceptualization) knowledge, regard- 
less of domain, is a crucial pre-condition to comprehension 
(Pearson & Johnson, 1978; Trabasso, in press) since without 
understanding the basic concepts contained in the text or 
question, one cannot make inferential links. 

Knowledge of text structure also helps comprehension. In 
narrative and expository text, this may aid in a top-down 
fashion. For example, since stories have well defined episodic 
structures (Mandler & Johnson, 1977: Stein & Glenn, 1979) the 
reader may establish expectations of settings, of events v/hich 
create goals for the protagonist, of plans for achieving goals, of 
actions, of consequences or goal realizations and of reactions by 
the protagonist. These structures also presuppose context and 
relationa!, as well as functional, knowledge of the grammatical 
categories. ' 

Knowledge about social interaction and human inten- 
tionality may aid comprehension. Stories entail considerable 
knowledge about social and personal interaction(Schantz, 1975) 
as well as about goals, plans, and actions (Schank & Abelson, 
1977). In short, they represent a kind of naive psychology based 
upon a theory of actions and motives behind actions. Children 
acquire and use these naive theories of human motivation and 
goals to understand narratives. The problem is to determine what 
they know at different levels of development and how this 
knowledge interacts with what they read. 

Finally, knowledge of causal relations between events is 
crucial for making inferences. The reader's ability to generate 
causes and consequences of events enables the prediction and 
assimilation of events into a causal chain representation as well as 
the filling in via inferences of missing information. With repeated 
exposure to situations, the reader develops stereotyped gener- 
alized experiences, called scripts (Schank & Abelson, 1977), 
which allow a well-constructed, known causal chain to predict 
behavior. Deviations from the script require further inferencing. 
When scripts arc not available, the reader uses "plans" to acquire 
information and construct new scripts. In short, the read'er's 
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knowledge base, ineluding his cultural baekground, appears to 
be the bottom line for eomprchcnsion. 

IVhal Processes Are Involved? , . 

In the above diseussion. reference was made to "top- 
down** and "bcntom-up'' processes interacting in making 
inferences. When '^top-down/' the construction of a causal chain, 
inferential prediction, and event integration is preceded bv 
organized knowledge structures. When these are absent, th- 
reader must use wc^rd recognition, word knowledge, and 
linguistic skills to derive seiiicncc meanings and infer a model or 
framewo rk\ 

Wow these processes are accufnplished is a mystery, 
although some computer models are a\"ailahle, such as those 
discussed in Schank and Abelson ( 1 977) or Kintseh and Van Dijk 

(1978) . These approaches both involve the linking of proposi- 
tions: In the Schank and Abclson\s system the concepts are 
underlying meaning.s of arguments in propositions and the links 
arc causal iti nature whereas, in tiie Kintseh ..iid Van Dijk 
approach, the linkages are determined by concept o\erlap or 
repetition across propositions. 

li'/ia! Kinds of Inferences Are T/iere'/ 

In this section, we shall brielly summari/e the kinds of 
inferences detailed in an inference taxonomy by Warren. 
Nichola.s. and Trabasso (1979). According to Warren ct ai. 

(1979) . inferences may be divided into those which are logical 
(causal), those which arc informational, those which arc spatial 
or/temporal, those which are related ta script knowledge, those 

.which d:pend upon world knowledge in some general sense, and 
those which are primarily evaluative in nature. Here, we shall 
stress the first class of inferences since wc regard, them as 
necessary to the construction of relations between events and the 
building of a causal chain representation. Informational 
inferences are thought to be more intrapropositional in nature, 
and while they are crucial to forming conceptualizations of 
sentence content, and precede the causal connecting of such 
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conceptualizations, the construction of a causal chain is more 
important to what we arc considering. 

Logical inferences can go cither in a forward (consequent) 
or backwa^rd (antecedent) manner. For example, if we know the 
soal of a protagonist, we can expect or predict certain actions to 
occur as consequences. On the other hand, knowing his actions 
constrains inferences about the reasons why he is doing what he 
does. Warren et al. (1979) distinguished among four types of 
logical relations. 

1. Moiivanonal. Goals motivate either other goals or 
such overt actions as events (goals also motivate cognitions 
[thoughts] and emotional reactions motivate goals and**cogni- 
tions). 

2. Psvchoioi^iral cauM'.s. Actions which are involuntary, 
:\s well as thoughts and feelings, are psychologically caused. 
Crying, inferring, and becoming angry are examples. 

3. Physical causey. Physical or natural events or physical 
actions cause (mechanically cause) changes in state. Bri\'iking a 
leg or drinking a glass of water are examples of act'oiv^ which 
physically cause a change in state. 

4. I'Mahlcmcni. Enablements are those conditions, typi- 
cally states, which are necessary but not sufficient for a state or an 
action to occur. Having money enables one to buy things. 

This listing of causal links resembles, in part, that ol 
Schank and Abelson ('1977). In their system, actions result in 
(phvsieally cause) states, states enable act.s, states or actions 
/////^/('(psyehologically cause) a mental state, and mental actions 
(wals, thought*^, cognitions) arc the reasons for (motivate) 
phyviical actions. In addition, one can have preventative causes 
where a state disables an action. 

The logical relations identified above determine the kind 
of inferences made. If one (ocuscs on an event and asks a why 
question about that event, then the kind o\ inference required is 
determined by the nature of the link. This docs not mean, 
however, that the kind of processes invoked differ. The same 
process of finding events related to other events may occur for all 
four types, in fact. Omrsnson. Warren, and Trabasso (197H). 
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Lisiiii^ r((/bc tests on children 5 to K years in a[>;:, failed to find 
consisienf differences among logical causes. 



How Can We A.ssess Whai Jnfercnnw a Reader Makes? 

In this section, we shall notdeal with the question of what 
inferences readers make during reading. At the moment, there 
are no adequate methods for assessing this (see Trabasso & 
Nicholas, 1980, for a review on inferences by children) and there 
is considerable debate about how many inferences are necessary 
for the construction of a representiition of events in a narrative 
(VV arren, Nicholas, & Pra basso, 1 979). f here! ore, a co nsi der- 
ation of the kinds of question thai could help in finding out iT the 
reader could make certain inferences is now what needs to be 
discussed. In this discussion, we shall rely heavily upon a recent 
book by l.ehnert (1978) on answering questions. 

In order to illustrate the question types and relate them to 
the inference types above, read the Farmer and the Donkey story 
in Table L 

If the reader generates inferences which result in the 
construction of a causal chain of events^then his ability to answer 
questions about logical relations either during or after the 
reading of a narrative should reflect this generative capability. 
Questions can be posed which assess the reader's knowledge of 
causal antecedents or causal consequents (Lehnert, 1978). The 
question itself contains a conceptualization, and the syntactic 
form of the question determines which kind of relation is being 
queried. 

Referring to the Farmer and the Donkey story in Table I , 
we can ask causal antecedent questions on inferences of the types 
previously described. Consider, first, the following variants of a 
physical causal antecedent question. 

14, Why did the dog begin to bark loudly? 

15. What caused the dog to begin to bark loudly? 

16, What happened that resulted in thedog's beginning to 
bark loudly? : 

17. The dog barked loudly because,,,? 
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Table 1 

The Farmer and the Donkey Story 



L There was once an old farmer 

2. who owned a very stubborn donkey. 

3. One evening ihe fiirmer was trying to put his dt>nkey into its shed. 

4. First, the farmer pulled the cJoti!(e>'. 

5. but the donkey wouldn't move. 

6. Then the farmer pushed'the donkey. 

7. but still the donkey wouldn't move. 

8. Finally, the farmer a.sked his dog 

9. to bark loudly at the donkey 

10. and thereby Irightcn him into the shed. 

1 1 . But the dog refused. 

12. So then, the faVmcr asked his cat 

13. to scratch the dog 

14. so the dog would bark loudly 

15. and thereby frighten the donkey into the shed. 

16. But the cat replied. 

17. *M would gladly scratch the dog 

18. if only you would get me some milk." 

19. So the farmer wer.t lo his eow 

20. and asked for some milk 

21 . to give to the cat. 

22. But the'cow replied. 

23. *'! would gladly give you some milk 

24. if only you would give me some hay.** 

25. Tlitis. the farmer went to the haystack 

26. and got some hay.- ^ 

27. As soon as he gave the hay to the cow. 

28. the cow gave the farmer some milk. 

29. Then the farmer went to the eat 

30. and gave the milk to the cat. 

31. As soon as thqicat got the milk. 

32. it began to scratch the dog. 

33. As soon as the cat scratched ihc dog. 

34. the dog began to bark loudly. 

35. The harking so frightened the donkey 

36. that it jiimped immediately into its shed. 

Although why questions signal a causal antecedent 
rw:t»Uon, examples (14-17) indicate what question as well as what 
verbs or connectives can mark their relations. Note also that in 
each example, the same conceptualization (the dog barking 
loudly) is indicated. The reader then mast search his memory for 
that conceptc^aliz^tion (here, the cat scratched the dog) which 
resulted in the dog\ being in pain (an inference) and his barking. 

An example'of a psychological antecedent causal question 
is given in (18). 
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18. Why did the barking frighten the donkey? 

If an event leads to another event, and wc pose qiicsiions 
about the first event, then we are askingfor answers which eall for 
consequential conceptualizations, 

19. What happened when the farmer gave ihe cat milk? 

20. What resulted from the farmer giving ihe cat milk? 

21. What happened after the farmer gave the cat milk? 
Examples (19-21) show variations on a causal consequence 
question concerning the goal* satisfaction of ihc eat as a 
preeondiiion for the cat scratching the dog. In general, 
consec":ence questions are signalled by what happens when . . ,? 

li is also possible to pose consequence questions 
ncgaiivcly to see if the reader understands c\*enis that would not 
have occurred if certain prc-condiiions weren't met or if certain 
anlcccdci't events had not occurred, in the eoniexi of a story, 
thcs'.: arc hypothetical non-events. For example, 

22. What if the farmer hadn't given the cat milk? 

23. What would have happened if the farmer hricin'i given 
the cat milk? 

24. ff the farmer hadn't given the cat milk, then what 
would have happened? 

We can direct the reader towards C(Mi ^ccjuences by providjp.g 
int'ormaiion as in 

25. What did the cat do after the farmer ga\'c the cat milk? 
Question (25) specifically directs the reader to the cat's action. 

Motivational questions (\yh'd{ l.chncrt, 1978, refers to as 
Cioal Orientation) may be posed as anieeedcni or consequent 
(purpose) questions. For example. ^ 

26. Why did the farmer ask the cat to scratch the dog? 
could be anwercd by an antecedent event, 

27. The dog refused to bark al the donkey, 
or by a purpose, ■ 

28. He wanted to gel the dog to bark in order to frighten 
the donkey and make him jump into the barn. 

The event described in (27) resulted in or motivated the 
farmer to ask the <"at to scratch the dog since it was a failure in his 
initial attempt al his superordinate goal of getting the donkey 
into the barn. That led to the farmer's subsequent actions. The 
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event in (28) is the reason for or purpose of the farmer's asking the 
cat to scratch the dog. 

Motivation questiotis may require answers mvolving 
more than one subgoal and a major goal. Examples (29), (30), 
and (31) contain questions on actions which could be answered 
by two, three, or four goals or motives, respectively. 

29. Why did the farmer ask the cat to scratch the dog? 
(two reasons) 

30. Why did the farmer ask the cow for milk? (three 
reasons) 

31. Whv did the farmer give hay to the cow? (tour reasons) 
The fourth logical relation, that of enablement, is usually 

marked by How or What and calls for answers involving states or 
action which satisfy specific pre-conditions necessary for the 
event in the question to occur. Examples (32-34) show some 
variations: 

32. How was the farmer able to get the cow hay? 

33. What did the farmer need to do in order to get the cow 



34. What did the farmer do in order to get the cow some 



hay'.' 

hay? ' . 

Enablements may involve a long strmg of acts. If so, these 
become instrumental or procedural questions according to 
Lehnert ( 1 978). For example, asking -omeone for directions to a 
house or how to cook coq-au-vin requires a listing of actions and 
instruments. In the Farmer and the Donkey story, this amounts 
to almost retelling the story in response to the question: 

35. What did the farmer do in order to get the donkey mto 
the barn? 

Questions on logical relations between events either assess 
or promp: the readers generation of text-connecting or slot- 
filling '-^-^e^.ccs The questions considered next also assess or 
pror/ .crrntial comprehension but they do so withiu 
<;en^enc<:.^. 

The first set of within proposition questions contains what 
I ehnert ( 1 978) classified as concept completion questions. These 
questions require that the reader search his memory or the text 
for a missing component. These questions basically interrogate 
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case rc!ations(agents, instruments, etc), of which someexamples 
arc: 

36. Who gave the farmer some milk? 

37. What frightened the donkey? 

38. What did the cat reply when the farmer asked him to 
scratch the dog? .. , 

39. What did the donkey refuse to do? 

40. Where did the farmer go to get the hay? 

41. When did the cat scratch the dog? 

42. To whom did the farmer give the milk? 
Question.s (36-42) are probe questions. These could also 

be written as forced-choice, (h\yunctive\(]uc$t'\ons such as, 

43. Did the farmer give the milk to the dog or to the cat? 
or as verification questions requiring a yes/ no answer as in, 

44. Did the farmer ^ive the cow milk? 

Disjunctive and verification questions are easier since 
they specify tiie conceptualization fully and rei^^uire only a direct 
match between what is in memory and what isin the question. 
Furthermore, since they do not require a search arhong a large set 
of alternatives, the guessing probability is limited to' one over the 
number of alternatives specified in the disjunction o^qne-haif in 
tiie case of two'^alternatives and verification questib.ns. The 
disjunctive, verification, and what Lehnert ( 1978) calls "feature 
specification" (e.g.. What color is the dog?) questions are clo^ie to 
wiiat is normally termed 'literal comprehension." However, this 
term is misleading since even the understanding of sentences and\ 
their translation into a conceptualization involve considerable 
linguistic, semantic, contextual, and intential knowledge. It 
would be better to call these questions text-constrained and 
within pi-opositional, given the theoretical framework used here. 

Two other kinds of questions which cover several events 
that are inferential in nature also deserve mention. Both are 
judgmental in that they involve internal scales, one using social or 
personal opinion criteria, the other using quantification. For 
example, 

45. What should the farmer have done to persuade the 
donkey to get into the barn? 
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calls Tor an opinion and Tir the generation ofan alternative goal 
plan. The question involves an evaluation of what the 
protagonist did. While morality does not enter into this example, 
moral judgment questions are similar in form to (45), 

The second type of question calls for quantification and 
entails knowledge of classes and class-inclusion relations or an 
underlying scale for a state. For example, 

46. How many animals were there in the story? 

47. How badly did the farmer want the donkey to gel into 
the barn? . 

48. How did the donkey feel? 

Can We Promote Comprehension through 
Asking Inferential Questions? 

There has been a long history of study on whether asking . 
adjunct questions before, during, or after reading helps reading 
comprehension (Anderson & Biddle, 1975). The an.swer seems to 
be that such questions may help or hinder and it is not clear as to 
why. Another question arises as well, namely whether we 
promote or assess comprehension via questions'? One problem 
with prior research on this question is that the questions used 
were generated largely on intuitive and informal grounds and did , 
not follow from a model for language comprehension. In this 
section, we shall explore some implications of the causal-chain 
theory. 

According to the causal-chain model, the reader under- 
stands a narrative by 1) f orming conceptualizations of sentences 
and 2) linking conceptualizations by generating inferences which 
connect thern. Once the causal-chain is represented in memory, 
the reader is said to have understood the narrative and can now 
perform additional operations upon this representation by use of 
various interpretive or summarization or story grammar rules. 

The formation of the underlying conceptualizations 
appears to be a necessary pre-condition to connecting them. 
Thus, developmentally, one might expect individual sentence 
comprehension to precede that of linking sentences via 
inferences. This, in fact, appears to be the case. Omanson, 
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Warren, and Trabasso (1978) assessed within proposition 
comprehension of stories by five- and cight-year-oId children by 
the use of concept completion questions. Then, they asked the 
same children to make logical inferences via the use of causal 
antecedent and motivational questions, the inferences involving 
the linkage of the same propositions which they had probed with 
concept completion questions, The five- and eight-year old 
children were matched on how well ihcy answered the concept 
completion questions and then were compared on how well they 
answered the inference questions. The data showed iwothings: i) 
as the children more accurately retrieved concept completion 
information, the percentage of correct inferences also increased 
a result in line with the assertion that conceptual understanding 
underlies inference generation but 2) the older children 
generated more correct inferences despite the fact the two age 
groups were matched with regard to their memory of the 
propositions upon which the inference was based. Thus, finding 
relations between conceptualizations increases with age, inde- 
pendent of the ability to form the conceptualization. 

Returning to the question of comprehension assessment 
or promotion, we can no*>v examine the possible inlTaence of 
v/ithin and between conceptualization questions. In particular, if 
the reader is asked concept completion questions (who? whom? 
what?) after each action in the Farmer and the Donkey story, we 
can assess how well the reader understands individual propusi 
tions. \i is possible that such questioning could promote sentence 
comprehension but not promoic linking conceptualizations 
across sentences. In contrast, vvc could ask inferential questions 
(why?) which assess the reader's comprehension of relations 
between propositions. It is possible that questions which require 
the finding of logical lelations between events during reading 
could promote comprehension and memory by establishing more 
links in the causal chain. 

\ Wimmer (1979) has performed a pro\'t)cative study on 
these questions, using the Farmer and the r3onkey story. 
Wimmer studied how well four- and eight-year-old children 
could answer questions while listening to the story and also how 
well they could later retell the story. He asked different groups of 
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children why questions and whO; whom questions after each 
action in the story. (Unfortunately, no control, group was run 
where no questions were asked so we can t assess the effect of 
questions per sc.) 

Apparently, comprehei^sion, as assessed by immediate 
recall ol'thc story, was not affected by the kind of question asked 
sine: the respective percentages of propositions recalled by the 
why and who 'whom groups were 3^;] and 39. On this measure, the 
kind of question asked did not aid comprehension, i.e., the 
construction of a better memory representation. (Perhaps 
delayed recall would have been more sensitive to ({uality of 
the representation.) 

However, the why questions seem to have assessed the 
children's ability to construct a cau^il-chain representation 
better than the who whom questions. First, the correlation 
between accurticy on the why questions and recall of the story 
was significant and higher than that for the v/ho/'whom 
questions. The respective correlations were .77 (p< ,01) and .40 
{p>A)5). However, since the level of performance on probe 
questions for the who; whom group (86 percent) was higher than 
that on the why questions (63 percent), the differences between 
the correlations could have been a result of restriction of range 
rather than question effects. 

Another analysis, however, ,suggesis that the why 
questions assessed individual differences in comprehension 
better than the v, ho; whom questions and supports the assump- 
tion that understanding the concepts within a sentence precedes 
understanding of relations between sentences. Wimnicr com- 
pared those four-year-old children who answered all questions 
co rectly on their ability to recall the story. While the number of 
subjects' was small, those children (/; = 4) who answered all the 
why questi.\ns recalled KO percent of the story propositions and 
those ( // = (S) who answered all of the who /whom questions 
recalled 46 percent. Further, age differences in recall were nearly 
climinaud when the four- and eight-year-old children were 
matched on answering why questions, here the respective 
percentages (and numbers) were HO percent (/7>-4) and 93 percent 
(n 17).^ 
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Thus, wc luive some indication that children understand 
individual sentences herorc they connect them iiifcrcntiaMy and 
that understanding of the logical relation'^; between sentences 
leads to better retention of a narrative. 'I'he question as to 
whether questions promote comprehension and which questions 
one should use remains unanswered by the two studies discussed 
here. 

One goal ol" the above presentation has been to provide a 
framework in which to assess reading or listening comprehension 
via questions. The types of questions asked are systematically 
related to the types of relations that exist between states or 
actions in a narrative. The adv antage of the present approach is 
that it indicates tli o kind of processing required by the reader in 
understanding concepts and relations between concepts in 
stories. Since teachers try nearly exclusively to use questions as 
their main means to assess comprehension (Durkin. f977). a 
framework for syst-'matic question asking which either promotes 
or assesses comprehension should prove to be a useful aid! Basic 
research on the value of systematic and theory-based questioning 
should also evaluate the usefulness of such procedures. 
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Vocabulary Knowledge 



Richard C. Anderson 
Peter Frcebody - 

University of Illinois at Urbana-Champais^n 

Our aim in this paper is to summarize what i^* known about the 
role of vocabulary knowledge '\n reading comprehension. 
Though word identification skills are important ii reading, this 
paper is concerned exclusively with knowledge of word 
meanings. An assessment of the number of meanings a reader 
knows enables a remarkably accurate prediction of this 
individual's ability to comprehend discourse. Why this is true is 
poorly understood. Determining why is important because what 
should be done to build vocabulary knowledge depends on why it 
relates so strongly to reading. The deeper reasons why word 
knowledge correlates with comprehension cannot be determined 
satisfactorily vvithout improved methods of -slimatingthe size of 
people's vocabularies. Improved assessment methods hing :. in 
turn, on thoughtful answers to such questions as what is a word, 
- hat does it mean to know the meaning of a word, and what is the 
.nost efficient way of estimating vocabulary . size from an 
individual's performance on a sample of v^ords. 

, Vocabulary Knowledge and Linguistic Ability 

PJleasures of vocabulary knowledge are potent predictors 
of a variety of indices of linguistic ability. The strong relationship 
between vocabulary and gener: ' intelligence is one of the most 
robust findings in the history of intelligence testing. Termaii 
(1918), for instance, reported a correlation of .91 bc ^ veen mental 
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age (as asses .ctl b\ the Slanlbrd Revision of U Binci-Sinuin 
Scale) and Ihe vocabulary subsealc. On this basis he suggested 
thai the vocabulary measure alone constitutes a good estimate of 
performance on the enfiro scale and thus could be used as a short 
measure. !• 5nce then, this suggestion has-been tested wi, ;i various 
age groups/Table 1 summarizes representative evidence. Inthe.^c 
studies, correlations between vocabulary subtest ^ \)res and total 
test scores on a number of different iq and achievement tests have 
ranged from .71 to .98. ■ 

An equally consistent finding has been that word 
knowledge is strongly related to reading eomprehension. Davis 
(1944a. 1968) faetor ^inalyzed nine comprehension tests and 
found a main factor for word knowledge on whieh :i voeabuiary 
test loaded about .8. Vluirstone (1946) rcanalyj^cd Davis\originai 
data and found three major U\ ors: vocabulary knov ;:dgc, 
ability to draw inferences from , aragraph. and at^ility to grasp 
the main idea of a paragraph. . ii the years that followed, several 
factor analytic studies identified a ''reading eomprehension'' 
factor (Bot/um, 1951; Clark. 1972; Friichter. 1948; Wriuley. 
Saunders & Newhaus. 1958). The range of faetor loaciin;- > lor 
vocabulary tests in these studies was t! to .9? 

'f'his strong relationship has be.n found tJ i;old aerosN a 
wide range of language groups. T'horndike (1973) ^-ollcetu^' d-.;'.a 
from over 100.000 students from 15 eountrics. aci .ss throe a.^ : 
groups: he found median correlalio beiVvcen vocabula.-; 
knowledge and reading comprehen:.;on, corrected io ' test 
reliability, of .71 (10-year-old:'). .75 ( 14-ycar-olds), an.: .66 (17- 
18-ycar-olds). Thorndikc concluded that thr results '.ndieatc 
''how completely reading performance is '?ote niined ny wr J 
knowledge at different levels; and in different .unitries" ( p. 6/^ 
^ he uncorrected eorrelat )'ns arc reproduced n 'ral)le 2. 

Analyses of rcadainlity (ef. Boi rniith, i )66) alsc; deUK^M- 
strate the prcc'r-nent rolQof w- ^ knowledge. lu a study ofJ:- 
factors that make prose difllcuii tr cad. Coleman (^^^71) 
cxannned -'lorphologieal. syntactic, ami semantic properties of 
words 'ukI sentences. While he fcnind sentence complexity to he a 
fairlv importaiu variable, he was able to conclude that "any 
:ncasurc of word complexity (number of letters, morphemes, or 
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Table 



Correlations of Word Knowledge with Reading Comprehension 
in Fifteen Countries 



Age Group 



Country 



10 yrs. 



14 yrs. 



17 yrs. 



England 

Finland 

Hungary 

India 

Iran 

Israel 

Italy 

Netherlands 
New Zealand 
Scotland 
Sweden 
.United Stat 



Belgium (Fl) 
Belgium (Fr) 
Chile 



.537 
,588 
.543 
,735 
.617 
,594 
560 
498 
651 
580 
620 



.591 
.619 
.508 
.698 
.654 
.533 
,387 
,427 
.674 
,587 
,624 
,685 
,7If^ 
,598 
693 



.500 
,481 
,577 
,497 
395 
389 
320 
294 



,716 
559 
1?^ 



.446 
.310 
,536 
,579 
,584 
,679 



iVoie. I'rom 'I'horndi -j (1973). 

syllables; frequency of usage) will account for about 80 percent of 
the predicted variance'' (p. 184). Klare^(I974-I975), in a review of 
readability, also concluded that a two \ari:ibIc formuUV is 
sufficient for rrK)St practical purposes: one'variable relates to 
word difficulty and the other to syntactic or sentence difficulty. 
He went on to conclude that the word variables consistently 
more highly predictive of difficulty than is the sentence var iable. 
As would be expected, some index'of vocabulary difficulty has 
typically been given the heaviest weight in readability formulas, 

■J 

Why is Vocabulary Knowledge a Major Factor 
in Linguistic Ability? 

Th.re are three more or less distinct view.s of why 
vocabulary Lno\^ ledge is such an exirc nrci-nary correlate of 
linguistic ability. We will call the first the instrumentalist 
position; i:.div»'duals who score high on a vocabulary test are 
likciv to know more of the words in most texts they tiicounter 
tha. low scoring individuals. The heart of the instrumentalist 
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hypothesis is that knowing the words enables text ^^^^^^^'^^^'HeJ^"' 
sion, In other words, this hypothesis claims th^^ ^^/'^l 
knowledge is directly and importantly in the ^^''^ f^Hj^'^ai^^ 
resulting in text comprcb''nsion. 1 Inlikc the t^^'^ /:5^h|Mori^ 
described below, the instrumentalist hypothesis has ^ \0 
say about where vocabulary knowledge comes (rot^^M. V^^iy 
that, once possessed, it helps the reader understand ^j^ry 

According to ihe second position, Vocii^^^^t^el>^ l^.^l^ 
measure verbal apdtuctc, A person who - ores high ^^^^jT^ ^'Wc^^ 
has a quick mind. With the same amount of eNp°'^^^. tP^' 
culuire. this individual has learned more word inc^i"'|.,i^ 0< 
she also 'comprehends discourse more readily thi*^ j^^jn^ ^ho'^'^ 
wlio scores low on a vocabulary test. The cssemial ^^l^'^^i^^ tP^* 
aptitude hypothes' ; is that persons wnth large voC^*^ si^j^^ ^r^^ 
better ai discours ; c -mprehension because they po^^^^^in\, \\0^ 
mental agility. A ' ge vocabulary is not concciv^c] '^^J- ^^^vC^^ 
in a direct way in h-tter text understanding in this '^^^;^t>(^|Hhi''f' 
vocabulary test pcrlormancc is merely another **^j(;^^>k^^ ^' 
verbal abih.y and it -is verbal ability that muinlV ^ ^Mn^'^ 
whether text will be understood. r^^V|. 

. iie ihird position is the kn(r-^ ku/,u:c hypothcsi-''^ 
ance on vocabulary tests is seen as a reflection of ^•^tij^i^ q ^^H 
exposure to the culture. Thr person w ho scores h\&^^ //^^ ^^^p^/ 
and broader knowledge ol the culture. The essential 
is this knowledge that is crucial for text understaH^''^^ n^^^,;hh^^ 
than being directly important, possessing p •ertaiii /iO\^'|\\ifi? 
is only a sign that the individual may possrss th^^ ,,fi^ i^^'d^^' 
needed to understand a text, f-'or instance, the chiU' 
th- word mast is likely to ha.e knowledge abo,,;: '''^ ( <^^^K l^i*^ 
knowledge enables that child to understand a text ^^^^l^'.js^ 
sentences v. hich do not even involve the word /;?as7. )i:Pil.^,''vV^ 
jibed suddenly and the boom snapped across the 

(){ course, jihc, hoonh and cockpit are spcci^i'^j^VlA^^^^ ; ' 
too. it might be wondered wheihei the instruments'^' /^^VV-.''^ 
and the knowicdge hypothesis are leally diiY^i"C^y[^a^^\o 
^'crsions the two positions are distinguishable. ^f - n^OJ^^ 
instrumental position, as we ch(K)sc to charaetci'i'''^^^P^As^*''^ 
individual word meanings. The knowledge vic\V ^ \yC^^ 
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<-*(>nccptiial frameworks or *\schcmata'': inciividiial word mean- 
ings arc merely tiie expensed tip of the eonccptual ieeberg. 

Whieh ol" these three positions is most tenable? The main 
point to be made is that there are neither the theoretical tool?; nor 
the data to justily a eoneliisior. at the present time. A second 
important point is that it would be naive, indeed, to assume that 
one ofthc ptr;." '.ions will turn out to bcentirely right and the other 
two entirely wrong. 

The most fully developed position is' that vocabulary 
lMit)\vledge retlects verbal aptitude. As the studies reviewed 
earlier indicate, voeabularv tests intereorrclatc highly with a 
varict\ of other kinds ol'tcsts rellecting "intelligence.'' On its lace, 
this fact is hard to understand solely in terms of the 
instrument-.list or knowledge positions. Probably by melaphori- 
eal extension ol" notions ol* physical agility, it is customary to 
speak of people of high iniel!'. 'cnce as having "quick'' minds, 
Kcccmtly Earl Fhint and his associates have been trying to prove 
that this is more than a -vctaphor (cf. Hum. 1978), They 
lljcori/-.i that people ot high verbal ability are literally faster 
than v:?her people at elemental .verbal cc^Mng :• reeoding 
opcraiions. One task used to assess speed ci mc.al operations 
dc\'c!oped by Posner (cf. Posner& Mitchell. 1967) involves the 
subjects' deciding whether pairs of upper or lower ease letters 
match. In one c^>ndition, the stibjcct has tc^ judge if two letters 
have the san-.e name (e.g.. aA) and ir the other condititm, the 
decision is whether or not theletters are phy^* :allv' identical (e.g., 
AA). The subjects' responses arc timed. It is argued that a time 
measure derived frcmi this task is a pure ir Jex ol lU" speed of 
^ome elemental verbal opcrati^ms. ;.ioc;: the siibjtXi needs tn 
"look : p" in memory the nair'*^' of the two leticrs and compare 
h'Jm. Ifunt and his collaborai'^^s have fouwd that this measure 
ciM relates about .30 wiMi standardized tests of \crbal ability. 'I his 
is a relaticMiship that coi ' l nr)t have been, predicted and is not 
r"eadily explained by the other hypothesc:; being entcrtainc-T 

Nevertheless, the case is far from cnn'clusivc. The general 
abijUN' rests used in Hunt'ssUidies proh-'.hly placed subjects undei 
5east some implicit time pressure. This cou^d have given fast 
Workers an advantage. If so. the studies ma\ have re'vealed that 
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fiisi people arc last rather llian that last people arc smart. 
Consistent with this interpretation arc the results of a factor 
analysis of representative paper and pencil ability measures and 
laboratory reaction time tasks completed by Mnnt. Limncborg, 
and l,ewis (1075). "fhc measures c»f speed o\ really elemental 
processes, such as letter matching tinu\ loaded on a Lector that 
appears to represent clerical speed and accuracy instead of on the 
factor representing general intelligence. A study, of Kirby and 
Das (1077) also indicated that processing speed is a separable 
factor in tests nf verbal and spatial abilities. This conclusion 
seems to be a sound one since Thor-ulike (1973) found, in his 
study of F ading comprehension in l:> countries, only modest 
correlations between' performance on reading speed and 
comprehension tests. I he median corrected correlatitins were .42 
for lO-year-olds and .47 for 14-vear-oids. 

With respect to the ii.^trumentalist p osition, as the 
evidence reviewed earlier indicates, word difficulty is highly 
predictive of readability. Does this fact clinch the argument in 
favor of the instrumental hypothesis? No, since it is possible that 
variation among texts in vocabulary difficuity is mcreiv 
symptomatic of deeper differences in knowledge prerequisites. 
To prove that knowing ^hc mrnning of individual words has an 
important instrumental roie in understanding text \vo\:' ! require 
uore than correlational evidence. K would need to be shown a) 
that the substitution n\ easier or more diificnit words in a text 
makes that text easier or more dilTieult to comprehend, and b) 
that people are heiped to cor; ^ rchend a text il* t' cy learn the 
meanings of the unfainiliar words it contains. A cursory look ai 
the literature bearir.g on these points suggests that the 
assumptions of the insirumentaiist position ore unquestioned 
tenets rather than hypotheses in need of veriiici ion. 

There is Si^ip.c research in whieh texts \\rvc been :ilicred so 
as to vary word famiHarity (see Cha.i. 1058. for a review of the 
early studies). In a recent set of experimcnis, Witlrock, Marks, 
and' Doctorow (1975: sec also Marks. Doctorow, VVittrock, 
1974) replaced 15 percent of the words in several passages with 
either high-frequency or !ow-freciiicncy synonyms. 

1 here is some confusion in the Wittrrick et al. paper o\ er 
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thcf word IVcqu^-ii^^-- j^anip^^iations. 'vVhilt^ ihis detracts rronithe 
findings, the conclusions may hold for ^'ca:.y" and ^^hard" words. 
Sixth graders of every reading skill evidenced better 

comprehension of ^^Sts ^^ontaining ^"^^y words than texts 
containing hard Woj.js, wlicthcr they were reading or listciMng. 
Furthermore. chiUlr^i^ vvho began with an easy text latei' showed 
improved comprche-nsion f>f the hard version of the same text. 
•Performance on a voca^^^''^^^'>' ^^^^ suggested that children who 
had First received th^^. ^^nsv version of a passage were able to learn 
some of the low-IVe^^^jL-ncv words in tlic hard version. 

CXlicr recent ^.vid '♦''Cc is less ia%'orahle to the instrunicn • 
talisi position, iuinni'^" ^^^'^^ Brady (1^74) were unable to 
increase fourth, i'irth,^ rtnd sixth grade students' cdmf ehension of 
texts that contained -i substantial proportion of cillncult words 
by direct i; ' ' • ''nion ^''^ those words. e\cr though such 
instruction .'''':"'''^canfly increased the students' performariee on 
the \-ocabulary 'tcD^j^ themselves, 'f hesc authors eon'.-luded that 
the instrumental hvpf^thcsis seer-s be ruled out.J. -^ir:-. ^\i,iy. 
^ind Schreck ^cc ^i'-^^^^ ''any & Jenkins. 1977) were also 

unable to establish that vocabulary instruction improves reading 
compr.Miension. Se\xM-a! dilfereni jnctlmds for tcacliing word 
mean.ngs were cxpi^^p^^d. Aii wer^ at least somewh:it better than 
no instruction. The ni^tbod which proved most eflectivc with 
both average ^^^^d learni'-J^ ^^isablcd chiid'"'^'^ in\'olvcd i'"-nsive 
drill an(! practice Oh ^\,c words in isolation. r!owc\er. even when 
children had delinit..iv learned the meanings oT twelve difficult 
•^vords. they did bc'^^cr than uiiinstructed chiklicr* who 
definitely did not know these words on a cln/etest or in rrTcllinga 
brief siorv c^mtainin^ the twr]\ e difficult W(V.ls. We do not know 
how to ' reconcile "^he eonflieting restdts hearing on the 
instrumental h\ "'Ot|jj_.sis othr- than to conclude, as rc\'iewcrs of 
educational rescarct. nvj^sE so often conclude, that nicuc research 
is needed. 

Turning now to the third position. :here is r.oxv truly 
substantial ease n^.^j background k 'Wledge is crucial for 
rciiding comp^^^'^^^nsion (^^- ^^^derson. l*-^78). Howe\er. there is 
thin evidence to supp^^^^ ^'iew that vocr.Suiary scores 
priniarilv reflect sn^; [Kickground knovvledge/Wc shall cite inst 
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one study which sih^gcsts that the idea is plaUbibIc SlcffenoCn, 
Joag-Dev, and Anderson (1979) asked natives of the 11.5. and of 
India to read passages describing an American and an Indian 
wedding. The results showed that the native passages were read 
more rapidly and recalled in greater detail. There were more 
culturally appropriate elaborations of the native passages and 
more ciiliiirally inappropriate distortions of the foreign ones. 
The vocabulary of the two passages was closely conirolled. For 
i-nstance, there were only two words in the Indian passage, sari 
and (IhoiL referring to articles of women's and men's clothing, 
respeclively, thai would have bcc:i unfamiliar to any of the 
Amerirnn lubjects. These two words did not figure in any 
important way in the passage, so failure to know them could have 
had no more than a negligible effect. Still, n two item vocabulary 
test, examining knowledge of sari and cllioii, would have been an 
excellent predictor of performance on the Indian passage. All 
Indian subjects would have known both words. Some Americans 
woul'd have known r/r; but very few would have known dhoii^ It 
is apparent that the test .would have neatly divided .subjects in 
terms of the extent\)f their knowledge of Indian culture, which 
was ' iously the underlying reason for the large observed 
differences between Indians and Americans iri comprehensiorjv"' 
learning, and memory. / 

Instructional Implications of Different Hypothese^ ahont 
Vocahulary Knowiccii^e ' 

It is impoiiant to knovr which of the three hypotheses 
about vocahulary knowledge is most nearly correct because the 
views have r^idically different implications for the readmg 
curriculum. At one extreme, some who endorse the verbal | 
aptitude hvpothesis are fatalistic about whether a " environ- 1 
mental factor can have a major inilucncc on childrenVs reading. ^ 
They t: nd to recommend family planning instead of curriculum 
innovation as the final solutior r o the reading problem. Of course 
the w-bal aptitiuic pn;iriOn does not require the belief tliat 
her 'My is predornin.. - Alternatively, there are those who 
m- ain that verbal ability grows in proportion to the volume of 
experience with language. The greater the opportunities to use 
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aigiiagc the faster and move cfTiciciit become the elemental 
proecssing operations. In turn, speed and elTieieney permit 
greater benefit frorn each successive language encounter. More 
detailed accounts of 'his sort of position can he founcf in fhewell-- 
known papor by Laliin gc and Samuels ( 1974) and a recent paper 
hy PcrfcnJ nnd. i.c-^-v)ld : ii>79). 

* ho latter lormulation of the verbal aptitude hypothesis 
Icuds 10 the rceonuncndaiion that educators should try to 
maximize the amount ol* reading children do. Hovvcvcr, this is 
not very newsworthy. It is a practice thai vvoald be endorsed no 
matter what the theoretical persuasion. The chstiuctivc ernpliasis 
in the verbal aptitud' position is on speed and efficiency of 
processing. 7'his emphasis gives rise to the recommendation that 
beginning readers and poor readers receive extensive drill and 
practice on ' J'undamcntals" of reading, Accordiugto Perfctiiand 
l.csgotd (1979), the cnll activities should include even more 
practice than typically provided in word vocalization, more 
practice in speeded word recognition, and more praciice in 
immediate memory for the literal content of uwt. It should he 
noted that these suggestions are offered i;' the ,'irit oj a 
hypothesis. Perfetti and Lesgold acknowledge that, so far at 
least, attempts to facilitate text comprehension by providing 
speeded word drills have not proved very successful (sec 
especially Flcisher & Jenkins, i977). 

While, hke everyone else, the adv(Kate of the instruiricntal 
h> pot lies is fa\'ors lots tM reading and varied language experience, 
the distincti\'e feature of this view is that it invites direct 
vocabulary building cxercL^es. Becker ( 1977) fjas ar**'"d strongly 
for the instrumentalist positid-^ He maintained that (MTCC 
decoding skills have been mastered, the chiei* remaining factor in 
dctermming whciher a child will he a successful reader is 
vocabulary knowledge. Me claimed that schools have never had 
reading programs that systematically build vocabulary. Children 
from middle class backgrounds pick up word meanings anyway. 
But the same is less true, Becker argued, of children coining f-i.ii 
!ov;er cI'ij.. homes, which often fail to provide support for the 
continuous vocabulary and eoiv -nt growth important to school 
work. Consistent with this as: ...iption is some recent wc^rk by 
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Hall and Tin'c (197i)). win) iniiiul that lower clas,, parcnls. 
particularlv lower class black parents, use substantially fewer ol 
the words found m standardized intelligence tests when speaking 
with their children than do middle class parents. 

Becker proposed a reading curriculum in which every 
child would learn ab-uit 7.0(J0 bnsic words from direct 
instruction. The figure 7.000 comes f'om one estimate ot the 
numbc of basic words knon'U by the a crage high school senior 
(Dupuy 1974). Becker ackno'vlcdged ■ vM there arc laniihcs ol 
wor.f" with related r .-anings. thereby , rmivting the child .some 
got: • rilion beyond the words that are specifically taught. By 
and kn ac though, he bclicved that learning one vocabulary item 
gives litUc advantage in learning the next one. For instance, he 
illustrated morphological instruction on the following .set. of 
unrelated .vords: help, ipport, insis' nl. resist. rcn>gni=c. 
assist r%cn his so-called -'concept side" of the instruction 
entailed a component analysis of isolated words. So it this 
assumption is correct, direct teaching of a \ocnbulary o\ c^■cn 
7,000 basic word:-: would be an enormous task. Becker estimated 
that about 25 basic words would have to be taught per week from 
the third through the twelfth grade (p -^0). 

The distinctive curriculum imp; .ion of the knowledge 
hypothesis is that cencrallv new voc ! alary ought to be learned 
in the context of :icquiring new knowledge (of. Goodman, l')76. 
p 487) t'vcrv serious student of reading recognizes that the 
si.mificint aspect of vocabulary dc>. clopment is in the IcaVnmg of 
concepts not just words. The additional point that the knowledge 
position brines to !he fore is that concepts come in clusters that 
are systcmaticallv interrclatcd.'Rcturning to an earlier example^ 
the concept of nuisi cannot be acquired independently of 
concepts such as boat add sail. Thus, it would seem to be sensible 
for people to learn the jargon in the contest ol learnin- about 
sailine and the anatomy of sailboats. According to the knowledge 
hypot^hcMS. if a child were really naive, trying to teach a single 
sailing concept an'A word in isolation from the set of related 
cnni-pis and words would bc-incfficicht in the best case and 
completely fruitless in the worst case. 

' A thought experiment suggests the more genc- ji ponu 
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a[)(Hil the role of knowledge in \ ()e:ih\]lary learning. Snppose vou 
wished to tcaeh sfune Frcneh vocabulary to. let us say. two 
groups of linglisli-speaki \i: C'nnadian children, evenly matched 
on aptitude and achievement. One jMOup is from a downtown 
urban ar6^i, the other is from a sn^all fishing village. The body of 
words yon wish to teach is concerned with nshi!iu(//Y/n /^•/^\. nnls. 

castini:. hai), currents, etc.). Would you expect one group to 
learn the words more quickly and easily than tlu* other? Why'* We 
do not know of research that has dealt systematically with these 
cjucstion*.. On«; ^<nncwhat relevant srudy was carried out (\v Allen 
and ^inrion f 196^': The\ found tlia' pHysics 'undents were much 
bclte' . "a;: students in rccogrii/iag physics W'fids. They 
co: -icd f' -i(. for art students, physi-'s wcM'ds arc scmrMuically 
inu ^ct and vhus tiave to be fcrogni/cd on a moi'c piecemeal 
' ;^-.;l;.uity with an area of kno^vledge increased the 

I of the physics words. 

nowledgc can be sliced in \arious ways. Tin's far in this 
, . - . wx \\y • considered sets of words related because they are 
I ^.''.! udking about the same topic, Wortis may als(^ be 
cfjnf^ pUi.in/cd in terms o; families related to one another 
ause they convey related sets of distinetions. Consider an 
'\ Vi\\)\c involving verbs of visual perception.* The basic verb is 
sec. If you notice that l(u)k involves a deliberate act o^' seeing, it 
can then be appreciated that glimpse refers to a short act of seeing 
wbcr'/as ^Vr//7cc refers to a short act oi'looking. Stare on tiic other 
hand, refers to a prolonged act tif looking, 'f he \ ariations in sense 
amonu these verbs can be understood in terms of just two 
semantic features, intention and durati(wi. I'unhcr distinctions 
would be required t(^ encompass other \*erbs of visual perccplitm 
such as notice and examine. ' 

, We would ccmsider that a lesson that helped children 
sharpen and extend the distinctions involved in visual perception 
words to .be consisr^ nr with the spirit the knciwiedge pc^sition. 
V'hal the knowledge position wouhl not countenance is a 

parate voeabniary lesson thai included ej uice. mast, and 
miscefrany' of orhcr wcrrds. ficrein lies a difTcrefiCe frc':*; the 
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insirumerilalisl position, which does not seem to us to preclude 
exercises involving lists of unrelated words, 

Johnson and Pearsons (1978) book, TeachiniJi Reading: 
Vocabulary, appears to represent predominantly the knowledge 
position, though it is an eelectie treatment that also rellects 
influences from the other two views, Johnson and Pearson 
advocated teaching a basic sight vocabulary using "intensive 
direct instruction in the early grades and with older ehildren who 
do not read well!" (p. 28). They also endorsed both direct and 
indirect means for tqaching phonies, promoting morphological 

\inalysis, causing voeabulary knowledge to expand, and teaehing 
tiie v.se of ahr dictionary and thesaurus, Johnson and Pearson 

" devoid a chapter to the use of contextual clues.to figure out the 
meanings of unfamiliar and ambiguous words. Otherwise most 
of the exercises and games ^'suggested throughout the book 
involve sets of words outside.,the'context of stories or textbook 
chapters. However, the words usually involved ,sets of interre- 
lated distinctions, such as were illustrated above with verbs of 
visual perception. Almost every activity was designed to expand 
children's sen,sitivity To these di,stinctions. There is an apparent 
discrepancy between the goals of the activities, which are 
concerned with conceptual distinctions and relations, and the 
format of the activities, which is ba^:ed largely on isolate^ words. 
If the knowledge perspective wcrestrictly adhered lo, voeabulary 
instruction would not be thought of as a separate subject in 
school. . ' ' , ' 

^ * For the sake of clarity of cxpositic')n, we have presented 
the aptitude, instrumental, and knowledge positions in uneom- 
plicated and somewhat overdrawn form. We must emphasi/e 
again that no serious scholar in reading or related fields rigidly 
adheres to any one of these positions. In particular Hunt, who 
has been identified with the aptitude hypothesis, has explicitly 
and emphatically suued that vocabulary si/e also is a reflection ol 
an individual's accumulated knowledge of the world. Becker, 
whom we labeled an 'instrumentalist, heartily endorses some of 
the implications of both the aptitude and the knowledge views. 
Reading has been a fractious field: if a policy were followed of 

^ avoiding controversy where none genuinely exists, the q'lality ol 
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intellectual exchange aiul the sociopolitical climate might 
improve 'to the point where someone within the next decade 
could write a book entitled 'M, earning to Read: The (ireat 
Consensus.'] 

What Does It Mean to Know the Meaning of a Word? 

It is not clear that, if Ludwig Wittgenstein and Bertrand 
Russell were left alone in a room for three hours,- they could 
decide that they really knew the meaning of dog. As Labov(l973, 
p. 341) said, "Words have often been called slippery customers, 
and many scholars have been distressed by their tendency to shift 
their meanings and slide out from under any simple definition," 

An ordinary adult engaging in an ordinary conversation 
will be absolutely sure he knows the. meanings of almost all of the 
words he hears. Notice that the restriction to ordinary use is an 
important aspect of this confidence. Consider the term gold, for 
example. The person who is sure he knows the meaning of this 
word in an ordinary use will quickly retreat when in the company 
of jewelers, mining engineers, geologicaLsurvey assayists, or 
metallurgists. 

V/hat does a person know when he knows the meaning of 
a word in its ordinary, everyday, garden-variety sense? This issue 
is addressed in what we will refer to as the Standard Theory of 
semantics, according to which the meaning of a word can be 
analyzed into features (also called components, attributes, or 
properties), each of which represents one of the distinctions 
conveyed by the word. Necessary or essential features are usually 
distinguished from features that are merely characteristic. For 
instance, having a back could be said to be a necessary feature of 
chair since an object that is otherwise a.chair except for the lack 
of a back is really a stool instead of a chair. On the Gcher hand, the 
ability to fly is only a characteristic feature of bird since some 
birds (penguins) don't fly at all and others (chickens) do so very 
poorly. 

To define a term, in the strong sense, is to list the features 
necessary to capture' the essence'of the thing (or event or quality) 
designated by .the word. Saying this another way, a proper 
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definition indicates the attributes a thing must have in order to be 
designated by a word; if any of these necessary properties were 
missing thani^gjrd would not apply. Before we choose this as our 
criterion in the testing of children's word knowledge, however, 
we might wish to examine how well it applies to adults' normal 
use and understanding of words. ' . , 

How able are people to define the words they are sure they 
know? ''Not. very" is the answer if one insists upon the strong 
sense of define. Consider gold again. Upon being asked to define 
gold, the ordinary citizen might say that gold a) is precious, b) is a 
metal, aqd c) has a iparticular yellowish (i.e., golden) hue, 
The problem is that none of these is a necessary feature. Not all 
gold is a golden color. If, say, the Chinese were to discover a 
mountain of gold, the substance would no longer be precious. 
Not even the attribute of being a metal can be considered to be an 
•eternal, immutable property of gold for, unlikely though it is, 
th^re might be a. scientific breakthrough in which it was 
discovered that gold is not a metal. 

A unicorn is a beast with such and such defining 
characteristics. Of course there are no beasts with these 
properties; which is to say that unicorns do not exist. By the same 
logic, if being precious and being a metal are defining features of 
gold, it follows that if the Chinese were to discover a mountain of 
the substance or scientists were to determine that the substance is 
not a metal, one would be forced to conclude that gold did not 
exist. As Putnam (1975) has noted, this is a very odd conclusion, 
because there would still be this "stuff lying around that people 
used to call gold. We have a right to be suspicious of a semantic 
theory that backs us into such a peculiar corner. 

Another example will illuminate the point even more 
starkly. WhenJt comes to fine points of meaning, ordinary folks 
. tarn to experts as the final arbiters— to jewelers and metallurgists 
for the exact meaning of gold, to the Supreme Court for the 
proper interpretation of words in the Constitution, and so on. 
For the sake of the argument, it may be supposed that the 
American Psychiatric Association is the final* arbiter of the 
meaning of homosexual: For years, this august group defined 
homosexuality as a disease of sexual orientation. Recently, 
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however, the association declared that homosexuality is not a 
disease, Anita Bryant may not have agreed with that conclusion, 
h\it at least she understood it. If ^ the characterization of 
homosexuality ds a disease had been taken seriously as a defining 
feature, upon reconsidering its position, the American Psychiat- 
ric Association would have had to assert, "There is no such thing' 
as-homosexuality!" That conclusion would simply have left Ms, 
Bryant puzzled. 

There are other serious problems with Standard Theory, 
Notably, the members of a class called by the same name 
frequently dp not all share a single set of common properties, 
Wittgenstein(I953; see also Rc^ch, 1973; Rosch & Mervis, 1975) 
argued that things designated by the same word generally are 
related by "family resemblance," He intended an analogy to a 
human family whose members look and act alike. Mother and 
one son may have a prominent nose. Father and daughter may 
have the same hair color. And so on. But there may be no single 
respect in which they are all alike, no single feature which they all 
share, Wittgenstein claimed family resemblance was the most 
accurate characterization of the relationships among the various 
uses of most common words. To illustrate his p'oirit, he analyzed 
uses of the term game, noting the similarities and differences 
between team games, board games, and child i en's games. Others 
have shown the fuzziness and context sensitivity of the meanings 
of terms such as cup (Labov, 1973); eai (Anderson & Ortony, 
1975); red (Halff, Ortony, & Anderson, 1976); and held 
(Anderson, Pichert, Goetz, Schallert, Stevens', & Trollip, 1976), 

A great deal more could be said about semantic theory, 
(For authoritative, current treatments, see Clark & Clark, 1977, 
especially chapters 1 1-14; Fillmore, 1975; and Miller & Johnson- 
Laird, 1976,) The main point of this brief excursion into the 
meaning of meaning is to caution against holding up a standard 
of word comprehension/or children that adults could not meet. 

Depth of Word Knowledge 

It is useful to distinguish between two aspects of an 
individual's vocabulary knowle'dge. The first may be called 
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"breadth" of knowledge, by which we mean the number oi woras 
for which the person knows at leas' some of the significant 
aspects of meaning. Later sections of this paper will be concerned 
mainly with breadth of knowledge. 

Treated in , this section is a second dimension of 
vocabulary knowledge, namely the quality or "depth" of under- 
standing. We shall assume that, for most purposes, a person has a 
sufficiently deep understanding of a word if it conveys to him or 
her all of the distinctions that would be understood by an 
ordinary adult under normal circumstances. 

Eve Clark (1973) has marshalled an array of evidence 
which shows that the meaning a young child has for a word is 
likely to be more global, less differentiated than that of an older 
person. With increasing age, the child makes more and more of 
the adult distinctions. In other words, when first.acquired, the 
concept a child has for a word need not include all of the features 
of the adult concept. Eventually, in the normal course of affairs, 
the missing- features will be learned. 

While there are some differences in theoretical interpreta- 
tion and some findings appear to hinge on procedural details 
(Brewer & Stone, 1975; Glucksberg, Hay, & Danks, 1976; 
Nelson, 1977; Richards; 1976), most of the research done to date 
supports the conclusion that there is progressive differentiation 
of word meanings with increasing age and experience. 

Just one illustration will be provided of the kind of 
• evidence that points to this conclusion. Centner (1975) 
completed a theoretical analysis of verbs of possession which 
indicated that huv, sell, and spend entail a more complex set of 
distinctions than give and take. Notice that giving involves the 
transfer of something from one person to another. Selling 
likewise involves the transfer of something from one person to 
another, but it involves an additional transaction as well, the 
transfer of money from the buyer to the seller. The complemen- 
tary relationship holds between buying and taking. 

Centner expected children to acquire the full, adult 
meanings of these verbs in order of complexity. Children ranging 
from four to eight years of age were asked to make dolls act out 
transactions from directions involving each verb. For example, 
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the children were requested to ''make Ernfe sell Bert a (toy) cai^C,' 
The four-year-olds performed flawlessly with directions eontain-\ 
ing^m^ and take, but never correctly executed instructions that 
involved spend hu\\ or selL The eight-year-olds exhibited nearly 
perfect understanding of every direction except the ones 
containing sell. Overall, the results were exactly as expected: the 
adult meanings of verbs of possession are acquired in order of 
complexity. 

Gentnefs analysis (1975, p. 242) of the children's errors 
suggests that the younger ones treated the complex verbs as 
though they were simpler forms. She explained, , , the 
commonest incorrect response was some form of one-way 
transfer ... the young child acting out buy and ,vW/ completely 
disregards the money transfer that should be part of their 
meanings, yet performs the objectr transfer in the correct 
direction. He reacts to /^i/rasifit v^eictake. He treats .vW/ as ifit 
were f^/v^." When asked to "make Bert spend some money," even 
the youngest child correctly handles the money transfer but 
neglects to have Bert get anything for the money he "spends." The 
child treats spend money as though it meant give money away. 

Through'some quirk of the sociology of science, the in- 
depth study of word knowledge has been the special province of 
psycholinguists studying language development in young chil- 
dren. There is a substantial body of literature on selected 
vocabulary of children from about two through eight years of 
age. The literature involvingolder children and adults is meager. 

In our judgment, vocabulary knowledge continues to 
deepen throughout lifetimes; that is, as they grow older, most 
people continue to learn nuances and subtle distinctions 
^conveyed by words that in some sense they have known since 
childhood. There is no hard data to support this conjecture. 
However, an illustration will show that many adults still have 
something to Iqarn about even fairly common words. It is easy to 
find educated adults who confuse infer ?ind imply,. A person will 
say something along the lines, "I intended, by stating these 
argument.?, to infer that Of course, this individual should 
have said imply. Speakers imply. Listeners infer. The complica- 
tion, which no doubt makes the distinction difficult, is that- 
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speakers may report inferences they have made as well as get 
imphcations across to hsteners. 

Breadth of Word Knowledge 

U is dis.urbi„8 .0 exami-e available «™f "^Jj"^ 
.ve.a,e .vocabula. .i.= or va.io. 

l^IT-Srw^^itLX'?. can be seen ,ha. .he es.i.aees 

"uLo. obvious how .0 evaluate the different sampling 
methods Z response eri.eria .^a. hav= employed 

— l^Sttlul^^^ 
Slated: 

Although the rapid rate of ^Vntactic acquisition h^^^^^ 

respectful discussion ^^^^^^l^^^^ l^iuL.n of average 
impressive. The best f'?"'" ava u ^ „ 

intelligence learn new words at a rate of "'"^e j , , 
necessary to assume therefore hat t a P ,. 

srn2:.^trytrrfe'd^:r:fro"j"^ 

Miller did not specify whether or not he was referrmg to 

srror,hi :iru and ^^^^--^j::^ 

seems unlikely Mi let X have been including 

summarized m i able J. ivimc / ^ dee, no 

compounds and ^envattves^ How^^^^^^^^^ o .nde stand' these 

. systematic exam,nat,on o cH,^d^ n s^^ ^.^^^.^^^^ 

First Tit ongt^^^^^^^^ 
po.nts. F.rst .n or gm ^...^ate estimates of 

. m-an argument about 'exica ue important element in 

„e growth of "-''>^™"' '"^ 

; SoX betweenlhet Second how do we assess what are 

' '^="'7:';^9:o"s«'hoTand Ee.erson remarked .ha. even 
.hough .he of vocabulary .es.ing is a"fairly old one (p. 35), 
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Table 3 

Some Previous Estimates of Total Vocabulary Size at Selected Grades 



Grade 


Source 




1st 


M.E. Smith (1926) 


2,562 




Dolch (1936) 


2,703 




Ames (1964) 


12,400 




M.K. Smith (1941) 


17,000 




Shibies (1959) 


26,000 


3rd 


Dupuy (1974) 


2,000 




Holley (1919) 


3,144 




Terman (1916) 


3,600 




Brandenburg (1918) 


5,429 




Kirkpatrick (1907) 


6,620 




Cuff (1930) 


7,425 




M.K. Smith (1941) 


25,000 


7th 


Dupuy (1974) 


4,760 




Terman (1916) 


■ 7,200 




Holley (1919) 


8,478 




Kirkpatrick (1907) 


10,666 




JBrandenburg (1918) 


11,445 




Cuff (1930) 


14,910 




Bonscr, et al. (1915) 


26,520 




M.K. Smith (I94I) 


51,000 


College 


Seashore (1933) 


15,000 


sophomore 


Kirkpatrick (1907) 


19,000 


Seashore & Eckcrson (1940) 


60,000 




Gerlach (1917) 


85,300 




Gillette (1927) 


127,800 




Hartman (1946) 


200,000 



i\ote. Adapted from Seashore and Eckerson (1940) and Bayor (1976). 



substantial problems of measurement remained. By now, in the 
time span of educational research, we might want to call the field 
"ancient," and virtually all of those original problems persist. 

There are important practical reasons for attempting to 
make accurate assessments of total word knowledge. Language 
and reading programs aim to increase students' vocabularies. 
The. number of words presented to students varies, in part, 
according to what is regarded as the most authoritative thinking 
and research on vocabulary size and grpwth (Clifford, 1978). 
More reliable estimates would indicate the appropriateness of the 
assumptions of a program, and perhaps highlight periods of 
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growth to be capitalized upon. More generally, reliable estimates 
would indicate whether direct language instruction can plausibly 
account for a substantial proportion of the child's language 
growth, or whether word knowledge is acquired for the most part 
independently of formal instruction. To refer again to a concrete \ 
proposal, Becker's (1977) idea that underachieving children 
should be taught via direct instruction the vocabulary most high 
school seniors possess would be difficult, but perhaps feasible, if 
the children had to learn 25 new words a week. It would be out oj 
the question if they had to learn 25 words each school day, 

Next we will present some of the central issues in broad- 
gauged measurement of word knowledge. The discussion of these 
issues will reveal many of the reasons why estimates of 
vocabulary size have fluctuated so widely. Two general questions 
need to to be considered. First, how is a sample of word.s to be 
selected? Second, what kind oi' response from a subject will be 
regarded as evidence that a word is in the individual's 
vocabulary? 

Selecting a Sample of Words 

In determining what is to count as a word, the researcher 
needs to decide whether or not it is of interest to di.sccrn the 
subject's ability to use. derivatives and compounds (plurals, 
participles, tense markers, comparatives, etc.). Some authors, 
notably Seashore (1933). have preferred to calculate separate 
estimates for "special" terms and derivatives. Others, for example 
Dupuy (1974). have attempted to concentrate solely on "basic" 
words. Dupiiv. theauthor of one of the mo.st recent and thorough 
studies of word knowledge, sampled randomly from HV/'.sYtr'.v 
Third New Inicmational Dictionary (1961) and then applied 
three criteria to each word selected. The word had to be a main 
entry, a single word form (i.e., not a derivative or compound); 
and could not be technical, slang, foreign, or archaic. 

The systematic nature of this sampling creates its own 
equally systematic bia.ses. Some children may have acquired the 
generative rule for, say. negation by prefix.(for example, imahle 
or cUslwnest) and others may not have (Silvestri & Silvestri, 
1977). Do we wish to exclude this element of vocabulary 
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knowledge from the measure? Adults acquire a number ofspecial 
or technical terms in their areas of expertise or interest, so 
exclusion of technical terms denies many subjects the Opportunity 
of indicating their knowledge of a large number of words. 

What counts as a word will depend upon the researcher's^ 
principal purposes. However, affixes and derivatives- are 
important elements of word knowledge, and several questions 
related to their role are of considerable interest: In what way does 
knowledge of basic or root word forms relate to knowledge of the 
compound forms? Are entries organized conceptually in the 
personal dictionary such that the probability of knowing a 
compound word is the same as that of knowing all its family 
members, basic form included? Or is the chance of knowing a 
compound some combination of the frequencies of the particular 
compounding elements? Much is.to be gained from research into 
these issues. 

Whatever criteria are applied, there can be no doubt tl\at 
there are many thousands of words in English, Dupuy (1974) 
estimated that there are about a quarter of a ?nilIion mainentries 
in Webster's Dictionary (1961). Of these, he calculated that about 
12,300 are basic words. 

A source and method of selecting from that source is 
required whi(ih will lead to the most accurate estimates of total 
word knowledge. The most obvious way to start is to sample 
randomly from an unabridged dictionary. Dupuy (1974), for 
instance, selected one word from every page of the dictionary (the 
third word' from the top of alternating columns), and then 
applied the three criteria mentioned earlier for selecting the basic 
words out of this group. This procedure produced a final sample 
of 123 basic words. 

Once a random sample of words has been selected, a test is 
constructed to assess how. many of the words a person knows. 
Then, in principle, estimating the person's vocabulary size is 
straightforward. For instance, Dupuy's Brisic Word Vocabulary 
Test contains I percent of the 12,300 basic words he calculated 
are in lVehster\s, Therefore, the absolute size of the basic word 
vocabulary can be approximated by multiplying the score on this 
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test by 100. A person whose score is 60, after correction for 
'guessing, would be judged to have a bifsic vocabulary of 6,000 

words. i ■ ^ ^ 

One disadvantage of this method is self-evident. Estimated 
vocabulary size depends heavily . on the si/e of the dictionary. 
With respect to Dupuy, while he sampled initially from a large 
unabridged dictionary, a word had to appear as a major entry in 
each of three other smaller dictionaries in order to be counted as a 
basic word. A total of 979 words, 41 percent of the sample, were 
discarded on the basis of this rule. The result was a very 
conservative estimate of the number of basic words in American 
English and is one reason Dupuy^s estimates of bdsic vocabulary 
size- are so much smaller than those of other investigators. Of 
course, many of these words were very rare, but others such as 
cloudlet, escaping, hreczes\ invited, starling, ixnd unilateral 
would be familiar to most people. ^ 

Already discussed is;the issue of what to cjo with derivative 
and compound forms. A liberal-policy will leadio large estimates 
of vocabulary size. A conservative policy will produce smaller 
ones. Dupuy was conservative. He eliminated 7.7 percent of the 
words in his sample on the grounds that they were compounds or 
derivatives, including a great many familiar ones, such as 
grandchild, package, and toothache^ 

There are other, more subtle considerations in Selecting u 
random sample of words from a dictionary. Some procedures lor 
sampling from an unabridged dictionary can introduce system- 
atic error since all entries do not occupy the same amount of 
space on a page. This disproportion typically favors the words in 
more common use since these are the most elaborated, 
particularly in an unabridged dictionary where very many 
derivatives may be listed (Williams, 1932; Lorge&Chali, 1963). 
Consequently, while the words may seem to have been randomly 
selected, the frequency distribution of the sample may be. 
substantiaDv different from that of the population. This may 
partly account for the very Marge estimates of Seashore and 
Eckerson (1940) and Smith (1941). # 

A further problem is that projecting a vocabulary size 
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from performance on a random sampling of words is inefficient, 
If the subject provides the meaning of bibulous, then using up test 
time by asking for the meaning of bicycle is wasteful. When 
estimating subjects' total vocabulary size is the researcher's major 
aim, then efficiency of items covered per unit of examinee time is 
an importanUconsidcration, 

One obvious response to these problems is to select the 
sample from a frequency distribution of words. Tcrman and 
Merrill (1937) arranged their sample of words in order of 
"difficulty/' When the subject failed at six consecutive words, the 
vocabulary test was stopped. Dupuy (1974) recommended a 
similar procedure. Time can be saved by such a procedure, but 
vocabulary size is likely to be underestimated. Furthermore, 
heavy stress is placed on the assumption that the frequency 
distribution of the sample mimics that of the population. If this 
assumption fails, then multiplication of the subject's scare by the 
appropriate con.stant will produce a poor estimate of total words 
known. 

The characteristics of the two major current word 
frequency compilations available (Carroll, Davies, & Richman, 
1971; Ku^fera & Francis,, 1967) suggest a potential problem with 
frequency sampling. These analyses indicate that the distribu- 
tion of words is highly unbalanced, a conclusion reached over 25 
years ago by Horn (1954), who calculated that about 2,000 types 
will' account for about 95 percent of "running words in adult 
writing"; 3,000 for 96.9 percent; 4,000 for 97.8 percent; and 
10,000 for 99.4 percent. At the low frequency end of the scale, 
there is a tail that approaches, infinity. Even in a huge corpus, a 
vast number of words appear only once, twice, or not at all. Of 
the 86.741 word types listed by Carroll, Davies, and Richman 
from a corpus of over 5 million tokens, 35,079, or 40.44 percent, 
appeared once. Kucera and Francis found 44.72 percent of the 
words appeared once in a sample of over one million tokens. So, 
if the test is short, the subjects run the risk of not being able to 
show that they know several medium frequency words, since 
there will be such a large proportion of rare words in the sample. 
A resolution of this issue is important, since a frequency-based 
sampling technique seems the most accessible method for ' 
overcoming the problems of simple random sampling. 
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^ Fre^quency is aparameter which probably is very strongly^ 
related to- ^robabflity that a word will be known. There is 
evidence supporting this hypothesis from .a number of areas: 
multiple choice performance on standardized tesjs (Kibby, 1 977), 
recall of word meanings following preser^r^tion of pictures 
(Carrol & White, 1973; Duncan, 1977)^d word rec9gnition 
times following tachistoscopic pres^tation (Cohen, 1976; 
Rubenstein, Garfield, & Millikan, 19?0). Thrf only discrepant- 
finding has been that of Davis (1944b) wtio^fotind only a slight ' 
relationship between word difficulty and frequency. He ex- 
plainea^this result in terms of the role of compound words: While 
the root of the word may he very common and well-known, a 
certain affix-root compound may be very infrelq^Jenl^t almost 
equally WU-known, if the affix 'is familiar. A more analytic 
approach to the relatronship of this index of frequency of usage 
to "probability of knowledge would- entail t)ie use of "family" 
frequency, that is, the i;5equency of the root word .a'tad all its 

"compounds and derivativ^. We might expect th^t the relation- 
ship of this index' of frequency of usage to probability of 
knowledge would be-more orderly. 

indeed, we^are'willing to go further and speculate that the 

-relationship betweerv family frequency 'and probability of 
knowing^ ward>esembles the curve presented in Figure 1. In 
terms .0? brpadth of knowledge, we would expecf a ceiling at the 
upper-end of the frequency scale: mostpeople knpwall of the very 
common words. Other aspects- of the curve would" diffei^entiate 
individuals: The point at whTch the curve dropped fToni| the 
plateauMevel, and the slope of the function probably are theftwo 
parameters that would capture the important individual 
differences. Even for children, we might best think of the 'curve 
leveling out as the words become very infrequent, since it is likely 

' that, frdm their hobbies, interests, or the occupation of their 
parents,, most children would know some very rare words. 
Nevertheless, we have "drawn the lower portion of the curve as a 
broken line since we are less sure about the relationship in this 
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In summary, a good test of word knowledge would , 
present the subject with ja large number of words, sampled 
liberally from, the whole I'ange of word frequency. Techniques 
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Figure K 

Possible relationship between likelihood word meanings are known 
c and frequency of usage. - ^ 
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FAMILY FREQUENCY (ON AJ.OG SCALE) 



should be developed which allow accurate estimation of the 
relationship of a given subject's probability of knowing a word 
and the frequency of the word's morphological family.. 

Criteria for Determining that a 
Word Is in a Person's Vocabulary 

Four sorts of test formats have been enjployed in attempts 
to assess breadth of vocabulary knowledge: a) multiple choice; b) 
constructed answer in which the subject attempts to give a. 
definition, a synonym, an illustration, or use the word in a 
sentence or phrase; c) yes/ no judgments, in whiCh the subject 
checks the words in a list that he or she knows; and d) matching 
where the subject, pairs off words w^th their synonyms. Sims 
(1929) compared these four types Jibing data obtained from 
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students in fifth through the eighth grades. The correlation 
matrix Sims reported is reproduced in Table 4, Sims concluded 
that, although the checking method was as reliable as the others 
it did not seem to offer acceptable "construct validity. Onlyr 
seventy words were used, however, and Sims failed to counter- , 
balance for order or delay between tests. While there may be 
some questions about the trustworthiness of Sims' results, there 
is intuitive sense in the notion that the constructed answers, 
multiple choice, and matching tasks have more in common with 
one another than they have with a checking task that is not 
corrected for guessing, 

' Table 4 - "^s^ 

Correlations between Four Types of Vocabulary Tests 



1 



V Checking (yes no) .92* ' 

2. Multiple choice -54 .84* 

3. Matching .64 ' .85 .93* 

4. Constructed answer .56 /74 ^82 

.Vf^^f. From Slms'( 1929). 
*SpIil-hiiir re 1 ill hi 1 it y cocrtlclents. 

The question that needs examination is which of these 
methods will be of most theoretical and practical value as a 
measure of vocabulary. Three of these types will be discussed in 
the light of several issues. Since the points raised about the 
multiple choice format apply even more cogently to matching, 
the latter will not be dealt with separately. 

Multiple choice methods. People often possess partial 
knowledge of words. In these instances the items' distractors 
become crucial. An individual may select the correct synonym for 
platitude from the choices: a) duck-billed mammal, b) praise, c) 
commonplace remark, d) flatness. He may make the correct 
selection because he has heard the word used in reference to an 
utterance and with a negative connotation. This information, 
however, may not enable him to select correctly from a) 
commonplace remark, b) nonsense, c) irrelevant question, d) 
insult. The set of choices .constrains the individual's response to 
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different degrees, and different policies for generating distractors 
will, of course, lead to differences in performance, 

Lepley '( 1955, 1965), for example, constructed two forms 
of a synonym test, one employing distractors from the same 
semantic, category as the target, and another which used 
distractors from semantically diverse categories. Lepley (1965) 
found ^qual split-half reliability (,93 and 94) but only a ,66 
correlation between performance on the two scales, and 
significantly superior performance on the version requiring only 
gross discriminations. The correlation is surprisingly low given 
the common formal and the fact that the superficial demand 
characteristics were the same. Lepley's results illustrate the 
influence of the disiractor set. 

The multiple choice format is currently the most widely 
used in standarized vocabulary testing (e.g., California Achieve- 
ment Tests, 1977; Metropolitan Achievement Tesfs, 1970; 
Stanford Achievement Tests, 1973). The principal complaint 
.raised here so far is that tlfe distractors cannot avoid constraining 
the subject's response. If the.purpose of the test is to provide data 
on relative performance .only, not on absolute level of 
performance, then the distractors can be, and usually are, chosen 
to maximize the discriminating power of the item. If one is 
interested in vocabular> . t'c, then this policy will not do. 

Many vocabulap > ^^^(e.g., Stanford, 1973) use sentence 
completion in a multiple ciioice formal. Many of the problems 
already mentioned apply eve'n when the lest simulates a real 
encounter with the target word, J n addition, the question of the 
effects of various amounts of contextual support on estimated 
vocabulary si/c, with groups of words that vary in frequency of 
usage, has not been studied. There is research that suggests that 
individuals vary not only ifi.the si/e of their reading vocabularies 
but also in their ability to use context to deduce the meanings of 
unknown and partly known words (Mason, Knisely, & Kendall, 
1978; Pearson & Studt, 1975). ^ 

A tricky problem with the multiple choice format is that 
young children may not consider all the1iistractors(Asher, 1978; 
Brown, 1 975; Vurpillot, 1 968). They will often choose the first or 
second alternative if it makes reasonable enough sense. The test- 
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taking strategies of older children on multiple choice tests are no* 
yet well characterized, but there quite probably are strategic 
components of good performance which, serve to increase 
spuriously the relationship between a multiple choice vocabuMary 
test and other achievement or intelligence tests in the same 
format. An insidious possibility is that some of the apparent 
growth in vocabulary knowledge over the elementary school 
years is really attributable to the acquisition of more sophisti- 
cated test-taking skills. * 

In conclusion, the multiple choice format is the most 
popular^ one. It makes relatively efficient use of examinee time 
and must be reasonably valid, otherwise the strong relationships 
between performance on such tests and other measures of 
linguistic competence, summarized at the beginning of. this 
paper, would not have been obtained. The chief complication 
with the multiple choice format, when one wants absolute 
measures.of vocabulary knowledge, is how to choose distractors. 
A further problena is that multiple choice tests may make 
demands, on strategic knowledge in which young and poor^ 
readers/are deficient. 

• '^Constructed answer measures. To overcome the problem 
of selecting distractors, several researchers, notably Seashore 
(1933), Smith (1941), Terman and Merrill (19.37), have used a 
constructed answer format in which the subject reads or hears the 
target word and then writes or tells a definition ofit, lises it in a 
sentence, gives a synonym for it, of in some other way provides an 
indication of its sense and reference. Subjects can be encouraged 
to do any one of these .things just so long as the experimenter is 
convinced the word is "known." This format is capable of dealing 
with a variety of levels of knowing a word and avoids the issue of 
distractors. There are, however, two substantial problems with 
constructed answer measures: The problem of scoring the 
answers and the problem of response bias. 

In the. written format, in particular, a constructed answer 
measure is confounded by factors- such as spelling ability, 
sentence construction ability; and even the ability to wrjte legibly, 
all of which m.ay discourage a subject from elaborating on a word 
used or understood in conversation. A slightly . more subtle 
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problem, and one that is more difficult to control, resides in the 
fact that, if a liberal criterion is used and the subject is allowed a 
range of possible responses to a target word, then a particular 
strategy for responding may be adopted. The problem is that 
some words would be more easily explicated in a particular form. 
The ..word noun may be more easily exfrlained "through 
illustration than by definition, for instance. The research of 
Anglin ( 1 970) and Wolman and Baker ( 1 965) indicates that, up to 
the age of about 10-12 years, children tend to provide concrete 
definitions-by-illuslraiion rather than by an inclusive term or 
synonym. It is entirely possible that, depending on scoring 
criteria, the preference at a different age for certain explanatory 
strategies could produce spurious estimates of the rate of 

vocabulary growth. 

A really vexing problem is how liberally to score answers. 
How does one score synonyms in relation to apt illustrations or 
perfect usage in a sentence? In many instances, partial knowledge- 
is displayed, in one of our own recent testing sessions, it became 
clear that many fifth grade .students had partial knowledge of the 
word forhici. Several students knew that it had something jip do 
W'ith not being permitted to do something but did not have as part 
of their knowledge the fact ihai for hid is used in imperative 
speech acts. Wc soon realized that, in this case, we needed to ask 
for its use-in a sentence. We have found other more subtle and 
difficult cases of partial knowledge. For the word propelled. 
there was no problem in the students' recognition of the word 
because of their knowledge of propeller. When probed about the 
function of a propeller, many came close to generating the notion 
of propulsion on the theory that it would be strange to have a big 
round blade going around on the front of a plane unless it served 
some fairly fundamental purpose- and what planes do is move. 

Sonie words have no near-synonyms. There are other 
instances when the only synonym is a less frequent word than the 
target. In such cases, the subject is being asked to produce a rare 
word in order to show that a common word is known. 

There are some almost irresistible tendencies displayed by 
an examiner when administering a test with a constructed answer 
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format. After a few children have been tested, the examiner 
develops a sense.of which words are easy and,'which are difficult. 
It requires conscious effort to avoid expecting more explanation 
of thd-difficult . words and less for the easy words. If every subject 
has known chair and the current subject pats the seat of his stool 
as a response, then the tendency is to award full marks. If he pats 

' the wall for edifice, however, he might not/score so well Similarly 
there is an urge to expect more elaborated responses from older 
subjects. The preschooler who tells you^{hat an auiomohile"" goes 
brrrrrmmm" will strike you more fa^vorably than the college 
sophomore who gives you the same^^ answer In addition, the 
experimenter will witness explanatipns of words which entail 
subtle nonverbal as well as verbal cu^es. Young children typically 
employ hand movements, facial expressions, and gesture in their 
communications especially when dealing with words that are a 
little difficult for them. / 

The horns of the dilemma are these. Stringent, opera- 
tional, adult-like standards for ^evaluating whether a response 
indicates a word is known will confound what is supposed to be a 
measure of breadth of vocabulary knowledge with expository 
ability. Looser, more flexible standards will confound the 
measure with the subjective judgment of the examiners which 
may change from word to word, subject to subject, and occasion 
to occasion. ; ' 

■ So the liabilities of/the constructed answer method are 
both logistical and substantial. It is inefficient per unit of testing 

^ and scoring time, and it seems to rely on often subtle intuitions on 
the part of the examiner, especially when* the subject displays 
partial knowledge of an item. 

YesI no format. The final format to be considered is that 
of "checking,'' which we prefer to term a yes/no method. In this 
format the subject simply indicates whether or not the^rpeaning of 
a word is known. Two of the major difficulties that have arisen 
consistently in the discussion of the other two major formats arc 
the problem of response bias, and the need to present the subject 
with a large number of words chosen from a wide frequency 
range. The checking format can satisfy the second criterion 
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admirably but problems of validity arise. Sims (1929) concluded: 

The writer is inclined to believe that a good guess as to whether or not a 
child knows the meaning of a word is almost as satisfactory a method of 
determining vocabulary as checking tests. The relative simplicity of* 
such a measure, the ease of preparation and administration should not 
blind one to its invalidity, (p. 96) 

Chall and Dale (1950) reported that the average tendency to 
overestimate word knowledge in the yes/ no format over and 
above the definition format amounted to about 1 1 percent, and 
was more pronounced for rare words. 

It ought to be no real .surprise that a yes/ no test uncor- 
rected for decisions in the face of partial knowledge would give 
inflated estimates of vocabulary size and would correlate poorly 
with other measures. Consider the yes/ no task from the point of 
view of the test taker. Some individuals may deny that they know 
the word \^old because they do not know its atomic weight, while 
others will agree they know it because they havr m feeling that it 
can be used to refer to a color. 

The problem of correcting yes / no test .scores for guessing 
is not insuperable. Statin'i the issue more precisely, guessing is 
only part of the problem. The real issue, as the i^old example 
illustrates, is one of eliminaiing variation in the degree of confi- 
dence different individuals inust have before they arc willing to 
say, "Yes, I know, that word." 

Signal detection theory (Swets, 1 964) affords a conceptual 
and computational framework that may allow estimation of 
amount of word knowledge independent of judgmental stan- 
dards. This theory was originally developed for use in psycho- 
physical experimentation. In this setting, typically the subject is 
informed that he will hear a short burst of background noise and 
that there may be a tone sounded as well. The subject's task is to 
report whether or not a tone (the signal) was present. Research 
ha^ established that it is possible to get a very accurate estimate of 
a persons capacity to detect the signal by correcting for whatever 
tendency he or she ha.^ to report "hearing" the signal when it is not 
actually there. Pastore and Scheir^er (1974) have summarized 
research showing that this paradigm can be applied to the 
analysis of a broad range of perceptual and cognitive tasks. With 
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respect to vocabulary assessment, the work of Zimmerman, 
Broder, Shaughnessy, and Underwood ( 1 977) has suggested that, 
by usin.? close-to-English nonsense letter strings as the *'noise 
only" stimuli; signal detection methods might be applied to 
word knowledge. 

We are currently analyzing data collected from elemen- 
tary and high school subjects on large numbers of words. The 
students responded yes or no to a mixture of many English words 
and almost as many nonsense words. Later they completed 
standardized multiple choice questions on the real words. Our 
preliminary analyses have indicated that yes/ no scores adjusted 
according to signal detection theory, and other corrections for 
guessing and risk-taking, correlate highly with multiple choice 
performance. We later interviewed the subjects individually 
about a subset of the words. The data suggest that a value derived 
from the yes/ no task gives a better estimate of true word knowl- 
edge than performance on the standardized multiple choice test. 

The fact that words have multiple meanings poses a prob- 
lem for the yes/ no task, since presumably a person will check 
**yes'' if he or she knows any meaning of a word. This is not a 
small problem. According to Lovell (1941), 43 percent of the 
words used by Seashore and Eckerson ( 1 940) had multiple mean- 
ings. Recently, Balch (cited in Johnson & Pearson, 1978, p. 17) 
has reported that from 23 percent to 42 percent of the words in six 
widely used basic vocabulary lists have multiple meanings. In 
other recent research. Mason, Knisely, and Kendall (1978) have 
sh(5\vn that children are much less likely to know the secondary 
than the primary meaning of words used in their secondary sense 
in a popular basal series: It is apparent that the yes/ no format is 
riot suitable for distinguishing which of the meanings of a word 
are known. When that is the goal, some other method of assess- 
ment is required. 

In summary, the great attraction of the yes/ no format is 
that it permits the presentation of a very large number of words 
in a given interval of examinee time. Compared to the multiple 
choice format, it reduces somewhat the burden of preparing 
distractors and, compared to constructed answer formats, it side- 
steps vagaries of scoring. The notable problem with the yes/ no 



Vocabulary Knowledge 



117 



109 



task is that scores of individuals will be influenced markedly by 
differences in tendency to take risks in the face of uncertainty. If 
this problem can be solved, the yes/ no task might be very useful 
for assessment of breadth of word knowledge. 



Conclusion 

While current research demonstrates the importance of 
such factors as a reader's perspective on a text (Pichert Ander- 
son, 1977) and text stfucture(Mandler& Johnson, 1977; Meyer, 
1975), it is also clear that word knowledge is a requisite for read- 
ing comprehension: people who do not know the meanings of 
very many words are moKt probably poor readers. There are 
serious gaps in our understanding of why this is true and of how 
word knowledge grows throughout the life span. Filling those 
gaps promises to be both an intellectual and-a practical challenge 
of considerable importance. We judge that a criticaffirst step is 
the development of improved methods of assessing breadth of 
vocabulary knowledge. It is only after some refinement has been 
achieved at this level thai models of lexical development and in^\, 
structional programs can be based on realistic expectations 
about the acquisition of word meanings. 

We. conclude our review of vocabulary knowledge and 
vocabulary size with the realization that, since the turn of the 
century, a tremendous amount of energy has been put into 
answering the question, "How many words does an individual 
know?" We have' come to wonder if this question js properly 
framed. The nature of language may make it unanswerable and 
thus, for scientific.purposes, irrelevant. Empirical methods may 
be able to generate useful indices such as that discussed earlier— 
the relationship of the individual's knowledge of words to word 
frequency. H owever, to produce a single value from performance 
on a sample to represent total vocabulary size may be an exercise 
that relies too heavily on the assumptions of a static population 
of isolated words and' on an overly restrictive view of how we 
generate and use words in context. 
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Social Context of Learning to Read, 
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To adults, reading is a solitary activity, a kind of internal lan- 
guage process that contrasts with interpersonal talk. The contrast 
is not complete: We read song sheets aloud and together; we 
exchange notes during a lecture, thus using reading as well as 
writing for immediate interactional ends; and we listen alone to 
talk on the radio or TV, thus making a solitary activity 9f the 
comprehension of speech. But usually we talk with others and 
read alone. 

Not so with children, especially children just learning to 
read in the primary ^rades.JLearning to read, like mature reading 
later on, is certainly a cognitive process;>ut it is also a very social 
activity, deeply embedded in interactions with teacher and peers. 
Hopefully, as we understand those interactions more fully, we 
will be able to design more effective environments for helping 
children learn. This paper reviews research on children learning 
to read in classroom interactions in four parts: influences on time 
engaged in reading; differences in the focus of instruction; the 
complexities of reading group lessons; and peer interactions in 
. the older grades. 

Influences on Time Engaged in Reading 

One obvious way in which classroom interactions affect 
learning is through their effect on how much tinie children 
actually spend engaged in reading tasks. Three descriptions by 
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Piestrup (1973), McDermott (1978), and Au (in press, in prep- 
aration), are sociolinguistic and ethnographic analyses ot audio- 
or videotaped lessons. Two descriptions by Hess and Takanishi 
(1974) and Cazden (1973) are more traditional studies in which 
observers did on-the-spot coding. 

Piestrup's research (1973) is on sources of interference 
between the language of black children and their teachers. In an 
analysis of 104 reading instruction episodes audiotapedin 14 first 
grade classrooms with predominantly black children, Piestrup 
identified two kinds of interference which she labelled structural 
and functional. Whether the mismatch is only a temporary 
misunderstanding or a more serious barrier depends on the 
teacher's understanding of the problem and her response to it. In 
the following episode about a workbopk lesson, the teacher 
explicitly and effectively dealt with a structural (dialect) 



conflict: 



T **. . . how would you harm the coit?" 
Ci Tear it. 
T Huh? 
Ci Tear it. 

T Th— ih— Oh! Do you, do you know what a colt is, now? 
Ci Oh, kill it, kill it! 
T No, whales a colt? 
Ci Somethin' you wear. 

T There's an in it, "Coat" is c-o-a-th— don't laugh, that's all 
right. "Colt" is very hard for city children, because they haven't 
been out on the farm, and they don't know about it. It's a baby, a 
babycolt. 

Ci A baby^colt. 

Ci Oh' yeah !\ 

T Remember \he story? An* it's a c-o-l-t. "Coat" is c-o-a-t, and 
there's no "1" ifi it, but listen to— Keisha— co/t, co/t, co/t. Now do 
you know what a colt is? 



This is an expanded version of a paper by the same title which appears in L.B. Resnick & 
P.A. Weaver (Eds.), Theory and Practice of Early Reading. Reprinted by permission ol 
Lawrence Erlbaum Associates, Inc. / 
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Ca Ycalu I know. 
T What is it? 
Ci A baby horse. 

T Yes, uh-huh, how could you harm a baby horse? 

(Piestriip. 1973, pp. 3-5) 

Interference is termed functional rather th^n structural 
when the mismatch comes from the functions language is used for 
rather than from structural features of the language itself. In the 
following excerpt from oral reading, the children shift away from 
discussion of remote content to verbal play; the teacher is.ignored 
and fails to get their attention back to the reading task: 

Cj "Off to ihe-^ 

T OK. It says **wood/* 

Ci wood. 

T Wc would s.ay woods- this book was written in England, 

Ci Now. Pm through. I ain't |j;*)nna read this page again. 

T OK. Well, wc*re gonna turn the page and weVejust gonna read the 

next page. 

Ci Uh uh! Darren 'sposcd to be first. 

T WclK Pm waiting for Darren to come back. Come on. Darren. 

C: He just playin* aroun' ^ 

(not elcar). 

Ci He crack his knuckles, in the buckles, 
Cx IJh-uh. 

T OK. Zip "and Wendy ran to^the woods, and here's the - 
Ci I got a tow truck. My mama bought me one. 
T father. 

Ci An* I got me a car to hook it on. It got a hook - 

' (Piestrup, 1973. pp. 6-7) 

The two teachers out of the group of fourteen who were 
able to accommodate most effectively to both structural and 
functional sources of interference, termed "Black Artful" by 
Piestrup. had teaching episodes that were both lively and focused 
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on reading, and their children had the highest reading scores at 
the end of first grade. Piestrup concludes that "the ways teachers 
communicate in the classroom are crucial to children's success in 
learning to read" (p. 170). 

McDermott (1978) has done an intensive microanalysis, 
frame by frame, of videotapes of two 30-minute reading groups 
(top group and bottom group) in one first grade classroom. 
During those 30-minutes around the reading table, children in 
the top group spend three times as much time on task as children 
in the bottom group, and McDermott has tried to understand 
how this happens. First, the procedure for allocating turns to 
read is different in the two groups. In the top group, the number 
of pages in the story is allocated equally among the children, and 
each child reads his share in order around the table. In the bottom 
group, there's no fixed' order and each turn is negotiated 
according to who requests a turn and who the teacher thinks can 
read the page in question. Interruptions are more frequent in the 
bottom group (40 vs. 2 for the top group) and more disruptive 
because continuation of reading is more dependent on the 
teacher for assigning the next turn:. Some of these interruptions 
are even initiated by the teacher herself: 

On one occasion, for example, .*;he organizes the ehildren lo eall for a 
turn to read their new books, "Raise your hands if you can read page 4." 
The ehildren straighten themselves up in their ehalrs, form neat lines 
along the sides of the reading table, and either raise their hands for a 
turn or at least look at their books or the teaeher. As their hands reach 
their highest point, the teacher looks away from the reading group to 
the back of the room. She yells at one child in the top group, and then 
another child in the top group. The three children in the bottom group 
who rai.sed their hands, lower them to the tablcrAnothcr little boy who 
didn t have his hand raised thrusts his chair back away from the reading 
table and the teacher and balances it on its two back legs. The other two 
children in the group simply look down at their books. The teacher 
returns and says. -'Nobody can read page 4? Why not?" Eventually the 
ehildren re,cover. and someone gets a turn. But it all takes time. 

(McDermott, 1978) 

' How does this contrast come about? Possibly the teacher 
has been told somewhere that calling on children in a random 
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order helps keep the attention of potentially more disorderly, 
children. More importantly, McDcrmott (1978) suggests: 

What is driving ihis whole system? 1 don't mink it is the negative expec- 
tations ol" the teacher. Rather, the children in the bottom group repre- 
sent pedagogical and interactional problems lor the teacher. I'cda- 
gogically. there is no doubt that it is easier lor the teaeher to practice 
reading with the children in the top group than to struggle with the 
process of teaching decodin^^to the children in the bottom group. And 
intcractionally. there i.s the pressure of the eonipetition between the 
groups and the searred identities of the children in the bottom group. 
Even within the bottom group we hear claims iif one child against 
another. ("Oh. you can't read." "Better than you.") Or we can point to a 
child in the bottom group who constantly calls for turns to read while, at 
the same time, appears to struggle to make sure that she does not get eye 
eontact with the tea'.'her. 

In response to all these problems, the teaeher and the children in the 
bottom group make adaptations. In response to all these pressures they 
struggle to solve the pedagogical and interactional problems of coming 
to school not knowing how to read, of having a teachcf who expects 
them to know how to read, of having a teacher who doesn't know how 
to overcome that they do not know how to read while she has twenty 
other children walking around the room, and of overcoming the 
pressure of having the other children taunt them for their performances. 
In response to all this, they make very specific adaptations. One adapta- 
tion is to make sure that no one child is isolated to read something too 
diffieult. So the teacher uses the two different turn taking systems with 
the different groups, and this adaptation has the consequences already 
e.xplieated. 



McDcrmott (1978) concludes: 

Success in learning is best predicted by the time a child spends on a task: 
some may learn faster than others, but with time, almost any child can 
learn what has to be learned in school, if there are the proper organiza- 
tional constraints for getting the child on task forthenecessary amount 
of time. The question of why some children achieve more than others 
■ has been transformed into a question about the environments in terms 
of which some children get consistently organized to attend to school 
tasks in classrooms while others do not — 
• ..Certain children, vyho. for whatever reasons come to school behind 
their peers in the development of classroom skills, constitute both 
' pedagogical and interactional problems for most teachers. Most 
teachers say of them that they are harder to teach: part of that reaction is 
that they need more of the teacher's tim^e if they are to catch up with 
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ihcir peers. In adclllion, ihcy must learn under ihc pressure ofknowing 
thai ihey arc behind, generally in a classroom which allocates status in 
part on the basis of the children's intellectual ranking in the class- 
room. ... 

Thus, the small differences between children in the early years of school 
expand quickly to the drastic formfi of differential performance which 
become obvious in later years. At the root of these differences is not so 
much the extreme complexity of the .school tasks, nor the differences in 
the learning potentials of the different children., but 'the differential 
• environments we offer the children for getting organized and on task so 
that learning can take place. ., 

I- think we have to acknowledge that what McDermott has 
exposed would be found elsewhere if we dared to take as close a 
look. 

Fortunately, we have reports of one success story too. The 
Kamehameha Early Education Brogram (kckp) is in a privately 
supported school for ethnically Hawaiian children, whose read- 
ing achievement in regular classrooms traditionally has b^en very 
low. In 1976 a new reading program was introduced at K|:E:p,and 
reading scores of the first grade children increased from an 
average of the 1 9th percentile, in the preceding three years to the 
69th percentile (Jordan, Weisner, Tharp, & Au, 1978). 
- ' According to Au (in press), the new reading lessons have 
three component parts, which she labels ktr, for Experience, 
Text and /Relationships: The teacher begins by evoking com- 
ments from the children about their experiences that relate to the 
story (which is usually from a basal reader); she then as<;igns a 
page or two of text to be read silently and questions the children 
about the text; finally, she draws out relationships between the 
text and their experiences. 

So far, except for the careful attention to evoking the 
children's personal experiences to engage their attention and 
provide schemata for comprehension, this sequence does not 
sound different enouglr to account for such striking gains. Au 
believes that the success lies not only in the cultural congruence of 
the content but, as with Piestrup s Black Artful teachers, in the 
cultural congruence of the context as well. Briefly, the rapid 
interactions between teacher and children, and the cooperative 
interaction among the children who build on one another's 
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responses, produce lesson talk with striking similarities to "talk- 
story," a form of collaborative narrative of personal experience 
that is a special speech event in Hawaiian culture. Au (in prepara- 
tion) is now documenting in more detail, from an analysis of 
videotaped lessons, how this talk happens and how the teacher 
channels the talk-story-like ways of speaking toward academic 
ends. Quantitatively, the children are certainly more engaged in 
reading tasks in the new program; but qualitatively the focui of 
their attention has been changed as well. \ 

Hess and Takanishi (1974) observed student "engage>t 
ment" in eight 30-minute observations in 39 elementary schoolN 
classrooms in low-incorne communities to find out what teachers 
did to "turn on" their students to academic work in mathematics 
and language arts. Overall, they found that student engagement 
was strongly and consistently related to teacher behavior, but not 
to classroom architecture, nor to student characteristics such as 
sex and ethnicity. Two demonstrations of intra-teacher consist- 
ency in their data are impressive. First, two teachers were 
observed during two consecutive years. Although they had 
completely different classes and reported that they felt large 
differences between the two years, the mean level of engagement 
in their classes remained almost identical. Second, during the 
s^ccond year of the study, an entire school being observed moved 
from a self-contained classroom building to one with open-space 
architecture. The overall level of engagement across these very 
different physical environment^ was identical (82 and 83 
percent), and the rank order of teachers in terms of percent 
engagement in their classrooms was .85. 

Contrary to expectations, Hess and Takanishi found that 
these levels of student engagement were not consistently related, 
across teachers to "specific teacher strategies" such as the 
frequency of specific questions or of feedback; instead they were 
strongly related to more "global instructional strategies" such as 
instructional group size (more engagement in small groups), 
and direction of student attention (more engagement when 
directed toward the teacher than toward other students or 
materials alone). The authors conclude with a recommendation 
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that teacher-training programs concentrate on skills in classroom 
social organization rather than on >more specific teaching 
behaviors. This is an important caution for competency-based 
training as it is usually conducted. 

A n observational study, done at the request of Children s 
Television Workshop, also measured children's engagement— or 
attention as we called it (Cazden, 1^73). We wanted to find out 
what environmental variables affected the viewing behavior of 
children watching The 'Electric Company in their elementary 
school classrooms. Viewing behaviorwas defined as both visual 
attention and verbalizations. We observed ten primary grade 
classrooms during the 30-minute show five or six times each. Two 
independent measures of attention were used: a scanof the entire 
class at 30-second intervals to count those visually oriented 
towardthe TV screens, and continuous monitoringand recording 
of the visual attention of individual students on an event- 
recorder. Monitoring individual attention on the event-recorder 
jwas extremely reliable (.94 interobserver agreement), and group 
attention averages from the 30-second scans had high validity 
(average within classroom correlation of .94 between measures of 
group and individual attention). Coding verbalizations was more 
difficult (interobserver reliability attained only .84). The ten 
classrooms were selected to represent a range in classroom 
"structure," defined here as a continuum from classrooms where 
attention to the show, was expected and enforced by the teacher 
("high" structure) to classrooms where a range of competing 
activities was available ("low" structure). As expected, we found 
that classroom structure was positively related to both group 
attention (correlation .87) and individual attention (correlation 
.95). High structure affected all children, increasing their 
attentiveness and responsiveness to The Electric Company, so 
that poor readers in high structure classrooms had higher 
attention scores than better readers in low structure classrooms. 

With the exception of one classroom, structure also 
correlated highly with average number of reading responses 
(correlation of .90 for nine classes, but only .38 for all ten). In the 
one exception, the most highly structured classroom was highest 
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in amount of attention paid by the students but lowest in average 
number of reading responses. Since there was nothing in the level 
of reading ability in the classroom that would explain this 
anomaly, we think that some aspect of this teacher's classroom 
controf (which we could not understand from our limited 
observations) discouraged overt reading responses. 

Because The Electric Company is designed esficcially Tor 
children reading below grade level, we were also interested in the 
relationship between viewing behavior and reading level. 
Children's reading ability can be categorized according to their 
relative standing in their class (high, middle or low reading 
group) or ranked more absolutely according to standardized test 
scores. Average attention of children in the lowest quartile of 
achievement test scores was 79. 1 percentile. While this was not as 
high as the 86.5 percentile and 90.2 percentile attention of the two 
middle quartiles (25-75 percentiles), it was higher than the 65.8 
percentile attention of the best readers and was encouraging 
evidence that the show was reaching its intended audience. More 
interesting and surprising was a finding that, without exception, 
children of the same tested reading level showed less attention 
and more fluctuations in their attention (more distractions) when 
they are among the lowest readers in their classroom than when 
they are in relatively higher reading groups. The data are shown 
below for the six second grade clas:::coms for which fall 
standardized test scores were available. 

Aiicniion ;nul l liiciuiiiion of Children in lliyh ami I.ow Reading (ironps 




C lass Siontlinsi 



1-25 



26-50 



' 51-76 



76-100 



Percent attention 



I.ow 



High 



0. (0) 
79.U20) 



89.5(10) 90.9(9) 
79.0(4) «7..'^(2) 

Nunihcr Iluctuations 



7.^2(11) 
49.4(5) 



Hijzh 0. (0) 51.2(10) 3(i6(9) 

Low /. 50.6(20) 58.6(4) 57.5(2) 



44.4(1 1) 
64.6(5) 
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Because our sample was not designed for matching 
numbers of children in each of these cells, firm conclusions 
cannot be drawn. But in these admittedly limited data, lower 
relative standing in class (in terms of reading group assignments) 
adversely affects children's attention to televised reading 
material. Seen in this way, a variable such as reading level 
(usually considered a child variable in its absolute sense) becomes 
an environmental variable as well through the child's relative 
standing in the classroom group. 

Differences in the Focus of Instruction 

Time on task is a powerful variable, but it is not the only 
one. One more qualitative variable is where the attention of 
children and teacher is focused during reading instruction. We 
know that learning to read requires mastering a complex set of 
concepts and skills at many levels of a hierarchical system. 
Analytically, we can separate a series of nested units— from the 
meaningless sounds symbolized by letters, to larger and larger 
meaningful units of words, phrases, clauses, paragraphs and 
stories; and we can isolate the conventions of punctuation, 
capitalization and layout on a page that support the communi- 
cation of meaning (remember that the division of print into lines 
Is one visual feature that does not carry meaning except in poetry 
and that children must therefore learn to ignore). But such 
analytical separation says nothing about how children should be 
helped; it does not determine in what order their attention should 
be focused on different units in the hierarchy, nor how an 
eventual integration can best be achieved. 

The simplest contrast in focus is between decoding skills 
and meaning. We know wb cannot tell what actually happens 
from the manuals on a teacher's desk or the methods she 
professes to use. For example, in one of the first grade reading 
studies supported by the Office of Education Coojierative 
Research Project, Chall and Feldman (1966) went behind 
"methodjA vs. method B" comparisons to e;camine what teachers 
actually did to implement those methods. Observational studies 
of teachers showed no significant relationship between the 
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ranking of the teacher's professed method emphasis (whether 
"sound-symboP' or "meaning") and the method emphasis 
observed in her classroom (p. 573). 

If attention to phonic skills and to meaning is included in 
reading group lessons, then that combination can create 
problems of shifting focus and complex interactions that will be 
discussed further below. Here I want to report research that 
describes classrooms in which these foci are separated— by 
children, by type of instructional event, or by language. 

Separation of Focus by Children 

In the classroom studied by McDermott, the focus 
differed from one group in the classroom to another. 

[In the lop group] occasionally, the children create problems by word 
calling instead of reading for meaning, and the teacher's main 
pedagogical task is to convince the children that there is living language 
complete with propositions with illocutionary force on the page. Thus, 
one child reads. "But Ricky saiti his mother. , ." in a dull monotone, and 
the teacher corrects her. "Let's read it this way. 'But Ricky, said his 
mother'.'* 

With the bottom group, the teacher has rather different problems. 
Accordingly, the teacher and the children constitute rather different 
environments for each other in the different groups. The children in the 
bottom group do not read as well as the children in the top group, and 
the teacher attends less to the language on the book's pages and more to 
the phonics skills needed to interpret any given word in the text. Thus, 
there are many more stopping places in the children's reading, and the 
story line which is to hold the lesson together is seldom alluded to and 
never developed. (McDermott, I97K) 

This same contrast between focus on meaning for better 
readers and focus on phonic skills for poorer readers is found in 
two other studies by Gumperz (1972) and Allington (1978). 
Gumperz reports observations in a first grade classroom in a 
racially integrated California district: 

We observed a reading session with a slow reading group of three 

children, and seven fast readers With the slow readers she [the, 

teacher] concentrated on the alphabet, on the spelling of individual . 
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" words., ^ She addressed the children in what white listeners y^^i^. 
identify as pedagogical style. Her enunciation was deliberate and 5 ^ . 
Each word was clearly articulated with even stress and P^'cn V if 
Pronunciation errors were corrected whenever they occurred; ev^^^d 
the reading task had ro be interrupted. The children seemed distr3 
and inattentive 

With the advanced group on the other hand reading became niuch i^^rc 
of a group activity and the atmosphere was more relaxed. Words ^ of 
. treated in context, as a part of a story , , ; , There was no correctii?^|ic 
pronunciation/ although some deviant forms were also heard, .rie 
children actually enjoyed competing with each other m reading an^^ of 
teacher responded by dropping her pedagogical monotone in fav 
more animated natural speech. (Gumperz, 1972) 
Allington's study (1978) suggests that this contras^j^At 
focus is not just a chance characteristic of two class ro oHis ^^^r^ 
happened to be studied by McDermott and Gumperz. AU^n^^st 
audiotaped oral reading segments of reading lessons with the 
and the poorest readers in 20 primary classrooms in three scl^^^^^ 
districts/He analyzed how the teachers responded to cbiWr 
oral reading errors and found dramatic differences between 
■ two reading groups across the 20 classrooms, differences wI^q, 
fit exactly the pictures described more qualitatively by ~^tie 
Dermott and Gumperz. First, there was a difference m 
rate of teacher corrections of the errors (68 percent of P^^^ 
readers' errors were corrected, but only 24 percent of ^ of 
readers' errors). Second, there were differences in the utnit^^^cr 
the correction: teachers were more likely to interrupt P vs. 
readers at the point of error (§8 percent ot poor readers^ error- 
only 70 percent of good readers' errors) rather than Wai^^i^^^te 
the next phrase or clause boundary. Finally, there ^^h^ 
differences in the cues , provided by the teachers lo belp^re 
children read the right word: for the poor readers, the cUes ^ I7 
more apt to be graphemic/ phonemic (26 percent vs. only^^d 
percent of the cues for good' readers), while the cues for ^ I4 
readers were more apt to be semantic/ syntactic (3 1 percent V?' 
percent for poor readers). ^per 
The critical question raised by these reports is ^h^^jay 
such differentiated teacher behayior is helpful or not. Mane jn§ 
speaks from New' Zealand of the goals of education as helF 
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children become "self-improving systems" (personal communi- 
cation). In other words, the goal. is not to create children who 
never make mistakes, but rather children who have the capacity 
to notice their own mistakes and have strategies for correcting 
them. She has found that children in the first grade who do the 
most self-correcting are the children who become the better 
readers in |^e grades (Clay, 1973), Does being interrupted make 
.self-correction more or less likely to develop? And if a cue from 
the' teacher is needed, what kind of a cue should it be? 

Allington's paper is titled, "Are good and poor readers 
taught differently? Is that why poor readers are poor readers?" 
The implication is clear that he believes it is possible (as do 1) that 
these teacher behaviors to low group children may increase their 
problems in the long run. Prompt interruptions seem too much 
like a "law and order" approach to errors, as if the teacher is 
acting out of fear that the errors, like the children themselves, 
may get oufof control. But just because it is the long run that 
counts, we need longitudinal studies that follow teacher 
behaviors and children's progress over time. (I am grateful to 
Rebecca Barr for this caution.) Only then can we separate 
constructive individualization from destructive bias. 

Separation of Focus by Instructional Event 

A very different kind of separation of foci of attention is 
by instructional events distributed throughout the school day. As 
part of a larger study „of children's functional language 
competence in kindergarten and the'primary grades conducted at 
the Center for Applied Linguistics, Griffin (1977) has described 
the se.t of events in which reading happens in one first grade 
classroom. These include: readin^^ a recipe for hot cross buns that 
leads to a discussion of the meaning of ^'lukewarm" and 
experiments with feeling lukewarm milk later in the day; and 
story time when the teacher reads aloud, stopping frequently for 
talk about what is going to happen next. Griffin notices that 
comprehension skills of vocabulary and prediction were built in 
such nonreading group times of the day; whereas in the reading 
groups themselves, phonics was the overriding concern. This 
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separation was so consistent that definite expectations about 
appropriate responses had been learned by the children. If the 
teacher shifted momentarily to a meaning cue during a reading 
group, the children were apt to respond with a phonic-based 
response anyway. For example: 

One child was reading, ina vco' halting style: "The pigeon nies far. . . r 
He paused. The teacher repeated the sentence in a more fluent style with 
correct intonation and then gave the child a prompt: "The pigeon files 
far.... Think. Think whai it would say! The pigeon fiies far...." A 
second child chimes in saying, " 'A* says it name. /I vvw." (Griffin, 1977, 
p. 38!) 

In considering the merits of such a separation by 
instructional events, we must remember that it can only work in 
classrooms where there is a rich set of nonreading group events in 
which reading takes place. One tragic result of the pressures of 
the back-to-basic movements may be less time available for 
experiences like reading recipes and hearing stories in which 
"vocabulary building and. comprehension education can so 
meaningfully occur. 

Separation of Focus by Language 

In the most extreme case, a focus on meaning and a focus 
on phonics may be separated in a single classroom not only by 
reading events, but by languages as well. While this review does 
not attempt to cover research on learning to read in two 
languages, my own observations in a bilingual first grade 
classroom in Chicago are relevant here.* In the fall, the teacher:s • 
reading instruction was in Spanish only,, using a traditional 
syllabic approach (ma me mi mo mu). Around Christmas, when 
she felt that the children's oral English had developed 
sufficiently, the teacher started a phonic-based reading program 
in English. As she described that attempt afterward, it just didn't 
work; the children resisted and she finally stopped. About that 
time, in a graduate class she was taking, she read Sylvia Ashton- 

*Thcse observations are part of a research project on "The social and^cultural 
organization of interaction in classrooms of bilingual children," supported by nie grant 
780099 to Frederick Erickson and Courtney B. Cazden. 
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Warner's Teacher and felt immediately that those ideas fit her 
philosophy and her children. The result was that by February, 
when 1 visited again, instruction in Spanish reading via syllables 
coexisted, for all children, with instruction in English reading via 
"key words." M oreover, the teacher was consistent in the cues she 
gave the children in the two contexts. In Spanish she focused 
their attention on the syllabic components on which they had 
had extensive practice: 

Fe li pe to ma u na fo to. 
In English, she helped with a meaning cue: when a child couldn't 
read butter on his key word card, sheasked, "What doyou put'on 
your toast?" 

At first thought, such separation may seem detrimental to 
learning. Intuitively, it seems harder for children to get decoding 
and meaning cues together in a single mental act if they are taught 
separately in different parts of the school day, or even in different 
languages. On the other hand, maybe a clear and consistent focus 
of attention is helpful, especially for beginning readers, as long as 
both are included somehow, for all the children. 

The Complexities of Reading Group Lessons 

Primary grade reading groups are complex interactional 
scenes— complex because of the triangular relationship between 
a reader, a text being read, and the participation of teacher and 
peers; because of the many levels of organization of the text that 
may move unpredictably in and out of the focus of the teacher's ^ 
instruction; and because oral reading serves simultaneously as 
practice for the child and a context for evaluation by the teacher. 
Two research reports, by Dickinson, Kozak, Nelson, and Epstein 
( 1 977) and Heap ( 1 978, 1 979) say more about these complexities. 

Dickinson et al. described differences in single vs. multiple 
foci, and attendant differences in time spent off-task, in a math 
lesson and a reading gvonp lesson with first grade children in a 
single K-l' classroom. In the math lesson, the children were 
individually manipulating attribute blocks into intersecting sets. 
There was a repeated and, therefore, predictable sequence of 
teacher directives about placement of the blocks, questions to the 



132 



Uci 



Cazden 



children about what they had done, and finally a concluding 
statement about what they had fo;?nd out. In successive 
sequences, the two parts of each Jircctivc ^e,g,, ''Place the blue 
blocks in this circle" and ''Place the yellow blocks in this circle") 
were spoken with decreasing intervening t me, and successive 
questions to the children elicited progressively more information. 
In th|e reading group, in contrast, there was more variation and 
less predictability in both the focus of attention and the 
interactional structure. The teacher asked individual children to 
take turns reading aloud, but talk about the book title, table of 
contents, page numbers, and capital vs. lower case letters was 
interspersed in seemingly unpatterned ways. 

There were so many other differences between the two 
groups that no firm conclusions can be drawn— differences in 
activity, group size, and whether the group included all children 
present or only a subset. While the reading group was smaller, it 
did not include all the children in the roomatthetimeandsowas 
more subject to interruptions and divided teacher attention, it 
would take more controlled research to determine how much the 
interactional simplification of the math group alone contributed 
to the greater on-task engagement. 

The possible instructional value of such interactional 
simplification is not a new idea. Some of the success of Distar 
may be due to this feature. Such simplification has also been 
advocated in a discussion of the design of Sesame Street (Gibbon, 
Palmer,' & Fowles, 1975). A familiar example of holding the 
instructional frame constant while varying .the content is the 
Sesame Street categorization game, "One of these things is not 
like the other." Gibbon et al. explain the reason for this design: 

Varying! the content while keeping the format constant promotes 
familiarity with format conventions that are potentially useful for 
instructional purposes. The format of any program segment functions 
as a kindVf "frame ' for the in.structional content, a complex of auditory 
and visual conventions that the child can master through repeated 
exposure. For example, the viewer can learn to expect that a particular 
format will usually deal with a particular category of stimulus (letter, 
word number, concept) and with a particular intellectual activity 
(memorizing, sorting or classifying, guessing, combining). A particular 
sequence of events or types of events will reliably occur: a particular 
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type of feedback lo the viewer's implicit or explicit responses will be 
delivered. Moreover, a viewers familiarity with a given format can help" 
him determine at what point in the presentation the important 
information will come, how much of it there will be, perhaps even 
whether it is likely to be too easy, too difficult, or about right for him. 
Among the main instructional advantages afforded by these various 
* forms of cueing is that' they will entice the viewing child to attend to 

what is new in each succeeding application of the format, since it will, 
•'stand out" against the familiar background more than if the entire 
presentation were novel. As a result, learning and concept formation 
are enhanced. (Gibbon, Palmer, 8l Fowles, 1975, pp. 225-226) 

Reading groups as traditionally enacted in primary school 
classrooms are inherently complex in content and interactional 
structure because learning to read requires so many different 
kinds of learnings. We need interactional analyses of alternative 
organizations of reading events in which these learnings are 
separated or combined. 

Heap (1978, 1979) is studying the "social organization of 
reading activities" in 20 classrooms. He has finished only one 
year of a five-year project, and so only preliminary reports are 
available. In these reports, he has identified three . '"social 
organizational problems" in primary grade reading instruction; 
two will be familiar to teachers, and all three raise important 
questions about the relationship between social organization and 
individual cognition. . . 

The two familiar problemis are problems in evaluating a 
child's response. On the one hand, a child's correct answer in a 
reading group lesson may be an artefact of other resources that 
the group provides. As Hea^.describes a specific example, "As a 
task organized to make reading skills observable and evaluable, 
the reading lesson provided an unforeseen resource, reading' 
aloud, for a participant to continue to participate in the task" 
(Heap,, 1979, p. 4) by answering comprehension questions even 
though her book had been closed. On the other hand, a child's 
reading errors may be due to obstacles created by that same 
reading group organization— for example, anxiety about the 
social performance of reading aloud in front of peers. 

The third organizational problem is more complex. Here 
is Heap's example, from the comprehension section of a second 
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grade reading group lesson after the first part of "Rumpelstiltskin" 
had been read: ' 

Teacher No, Who helped Mineen? 
Child Rumpelstiltskin, 

Teacher Yeah the little man. We don t know his name is Rumpel- 
stiltskin yet do we? The little man. Okay, what was the first 
thing the prince said— sorry, that the girl gave to Rumpel- 
stiltskin; to the little man! We better call him the little man 
because we don^t know really he s Rumpelstiltskin yet (Heap, 
1978, p. 2), 

The story was titled "Rumpelstiltskin"; the teacher had 
written that name on the board as a new vocabulary word at the 
beginning of the lesson; and she knew that several of the children 
had seen a movie version of the story the previous year. Yet she 
still corrected the child, and self-corrected herself, from saying 
"Rumpelstiltskin" to the vaguer "little man." As Heap says, it is 
only true that "we don't know his name is Rumpelstiltskin yet" in 
a very special sense: within the limits of and the terms of a 
convention, a game, that disengages reading and answering 
questions about that reading from everything else the child 
knows, from everything outside the boundary of the text itself. 

These "organizational problems" thalHeap has described 
are not unique to reading groups. Any exercise of any cognitive 
process— for us, as for'children— takes place in some context: of 
particular task format, physical conditions, social organization, 
conventional rules, etc.; and characteristics of that context will 
contribute either supports or obstacles to the cognitive tasks 
performed within it. Cognition is always in some context; and it 
cant be taught or evaluated apart from some particular context 
either. (I am indebted to Michael Cole for many conversations on 
this point.) Because of the importance of reading groups as a 
context for 'both instruction and evaluation in the primary 
grades, we need to examine that context—and the "games" we 
play within it~-with particular care. Yet because reading groups 
are so traditional and so familiar, that examination is especially 
hard for participants to do for themselves. To complete the circle 
back to the first study by Hess and Takanishi reported at the 
beginning of this review, classroom teaching must be considered 
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as a complex orchestration of social life in which diverse 
individual cognitive processes can most effectively be developed, 
and we need to understand teaching from both the social and the 
cognitive points of view. 

Peer Group Interaction in the Older Grades 

This review, has focused on learning to read in the primary 
grades — partly because Qf my own interests and experiences as a 
teacher (Cazden, 1976) and researcher with younger children, 
partly because most of the recent sociolinguistic and ethno- 
graphic research has focused on the primary grades as the place 
where children are first inducted into the school "culture" and 

- where their academic "identities" are\first formed, I assume that 
classroom interactions are not less important for reading 

' instructions in the intermediate and secondary grades, but that 
the relevance takes different forms. One important form is the 
relationship between the instructional process and interactions 
among peers. 

Consider the implications of just one study: Labov's 
research on the relationships among the incidence of nonstan- 
dard Black English (be) dialect features, peer group mem^^rship, 
and reading failure (Labov & Robins, 1969; Labov, \n press). 
Labov and his colleagues analyzed the incidence of be features in 
the speech of black adolescents in fourth through tenth grade, 
identified the speakers as either central or peripheral members of 
peer group in the street culture^ by sociometric interviews and 
participant observation, and then correlated these data with 
performance on standardized reading tests. 

One linguistic indicator of BE dialect is saying /i^/ve for /i^/^ 
in the third singular present. Labov (in press) reports the. 
frequency of the standard form has, and then comments on the 
group differences: 

...club members used only 19 percent of the has form; the lames 
[isolates from the street culture] used 60 percent; and whit^ working- 
class adolescents 100 percent These [dialect] differences are slight: 

they are small differences in the probability of a rule being applied. They 
reflect patterns of communication and ideology, but In no way could 
they be conceived ofas the causes of differences in reading achievement. 
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Differencesin reading achievement existed. All 46 club members, 
core participants in the street culture, reached a virtual plateau in 
reading achievement at the fifth grade level, while the 32 lames 
continued to progress, one-tlyd of them at or above grade level. 
According to Labov, the dialect differences are not in themselves 
the cause of these reading problems; they are rather the indicators 
of group membership and of a value system that accompanies^ 
such membership that is in conflict with the school. 

Support for Labov's argument come^ from the co- 
occurrence of events around the fifth grade watershed year. This 
is the time when peer group formation differentiates members 
and lames in life on the street outside 'of school, and also 
differentiates their reading achievement within. More generally, 
fifth grade is the point at which, across the country, poor 
children's reading scores decline relative to their richer peers: 

A new stale report [by the state legislative analyst's office] says the 
achievement gap between richer and poorer children in the California 
schools appears to be widening. . [this decline] seems to be part of a 
national trend, with the decline beginning around the fifth grade and 
increasing later. (Palo Alto Times, 1 1/3/78) 

While these reading achievement data are usually interpreted in 
terms of the changing character of reading texts and tasks in the. 
intermediate grades, it seems probable that, for some students, 
value conflicts accentuate the problem. 

The connection between this research and the focus of this 
review comes in Labov's (in press) discussion of possible 
remedies. >^ 

The techniques of learning and studying imposed by our schools are 
avowedly individualistic and competitive. Each student is expected to 
learn by himself, and as I noted at the outset, interaction in the 
classroom is fundamentally confined to dealing directly with the 
teacher.... 

The skills that are highly developed in vernacular culture depend upon a 
different strategy. Sports, formal and informal, depend upon close 

cooperation of groups. The same holds for music Individuals 

practice by themselves, but the major steps in learning are done in 
tempo with the group 

If wc continue to repress vernacular culture, and try to extract one or 
two individuals from their cultural context, we will continue the pattern 
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of massive educational failure that we now observe in the schools. The 
other route is to understand the interests and concerns of the youth who 
come to school and use that understanding in a positive way (Labov, in 
press) 

This "positive way" will have to make possible less 
individualistic ways of learning to read, so that the power of 
group interactions can be used directly as contexts for learning, 
(See Steinberg & Cazden, in press, for one report of peer teaching 
in an intermediate grade,) Unfortunately, in discussions of 
teaching, the term "classroom interaction" has become limited to 
interactions in which the teacher is involved. We need to expand 
its meaning back to include all interactions in which learning 
takes place— not only with the teacher, but among students as 
well. 
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Social-Psychologicai Perceptions and 
Reading Comprehension 

Herbert J. Walberg 
Victoria Chou Hare 
Cynthia A. Pulliam 
University of Illinois at Chicago Circle 

Ancient psychologists and philosophers recognized the need for a 
balanced understanding of human beings in terms of the head, 
the heart, and the hand, or in modern psychological termi- 
nology—cognition, affect, and volition; but it has often been said 
that psychology lost its soul a long time ago, destroyed its mind at 
the turn of the present century, and is now in trouble with 
•behavior. Moreover, although psychologists have made valiant 
efforts to apply laboratory findings to natural settings of human 
action, the results have often been discouraging; for example, the 
so-called behavioral and cognitive revolutions in psychology are 
not notable for raising achievement test scores in ordinary 
classrooms in recent decades. What has been lacking in much 
psychological theory and research on school learning is a 
consideration of the student perceptions of the social-psycho- 
logical environment'of their classes, and the direct and indirect 
linkages of these perceptions to student outcome measures- 
such as standardized test performance, interest in the subject, and 
self-concept as a learner— that are of interest to public policy 
makers, educational practitioners, parents, and students. 

About a decade ago, however, research on student 
perceptions of the social-psychological aspects of their classes 
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began in high school physics in the United States and then 
expanded to other subject matters, grade levels, and countries. 
The research shows that student perceptions can be validly 
measured and can serve as an index for classes or individual 
students of the extent of cognitive, affective, and behavioral 
learning that goes on during the school year or shorter periods of 
time. The purposes of this chapter are: l/to explain how these 
student perceptions can be conveniently measured in ordmary 
classrooms, 2) to review the research on the prediction of learning 
outcomes from student perception^, 3) to relate this research to 
models of reading comprehension; and 4) to discuss the 
implications of the first three sections for research on reading as 
well as the practice of teaching reading. 

Research on student perceptions comes under the 
headings of morale.. climate,/br social environment in recent 
educational research. These terms emphasize that it is the general 
perception of the composite Qualities of classes rather than their 
so-called objective characteristics such as group size and counts 
of collective behavior that are bein^^ examined. There are a great 
number of theore'tical and methodological issues that cannot be 
fully discussed here because of space limitations, but the 
interested reader may find the following publications of help in 
pursuing the topics briefly considered here: Insel and Moos 
(1974) on social-psychological research of perceptions of a 
variety of human settings; Campbell (1970) and Majoribanks 
(1974) for valuable but neglected collections of substantive 
research on learning environments carried out in Australia, 
Canada, England, and the United States; Kahn and Weiss ( 1 97-3), 
Randhawa and Fu (1973), Shulnrian and Tamir (1973), and 
Walberg (1974, 1976) for substantive .and methodological 
reviews: and Walberg ( 1974, 1979) for sourcebooks on a variety 
of learning environment instruments, studies, and evaluations by 
a number of international groups. 

Measurement and Validity of Student Perceptions 

Much of the research on student perceptions of their 
classes employed the Learning Environment Inventory (l^i) or 
the My Class Inventory (mci). Table 1 shows sample items for the 
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Cohesivencss 
Satisfaction 
•Friction 
•Apathy 
" •Cliqueness 

Sms yariahk 
Democracy 
Competitiveness 

•Diversity 
Favoritism 

M Vambk 
Coal Direction 
Material Environment 
Formality 



Sample Items, Reliabilities, and Consistencies of 1$ Variables 



Sample Item 

All students know each other very well. 
The students enjoy their class work. 
Certain students responsible forpetty quarrels. 
Students don't care what the class does. 
Some students refuse to mix with the rest of the 
class. 

Students have about eqiial influence on the class. 
There is much competition in the class. 
Intercst^^vary greatly within the group. 
Certain students are favored more than the rest. 

Each student knows the goals of the course. 
The room is bright and comfortable. 
The class has rules to guide lis activities. ^ 
Students find the work hard to do. 
The course material is covered quickly. 
The class is disorganized and inefficient. 



Class 
Reliability 



.19 
,80 
.77 

.74 

.6! 
.56 
.36 
.65 

.73 
.79 
.87 

:8i 

.76 
.87 



Percent Positive 
Corriialions with ' 
Learning 



14 



81 

67 

30' 



73 
86 
65 
87 
54 
6 



Speed 

-Disorganization 

iVo/f. The learning criteria include cognitive, affective, and behavioral measures including higher menial processes and self-conccpl, 
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LEI intended for students in junior and senior high schools. The 
LEI consists of 105 items, 7 items per scale on each of 15 scales. 
Students rate their classes on four points— strongly agree, agree, 
disagree, and strongly disagree— on each of the items. For 
example, on the Cohesiveness scale, a student is asked to agree or 
disagree with the item' "All students know each other very welL" 
The seven items on each scale are summed for analysis. 

The" LEI can be administered in approximately 25 minutes 
and it produces reliable es'timates 6f the morale, climate, or 
learning environment of the class. In the interest of ^saving 
student time, some investigators have omitted some of the scales 
or reduced the number of items per scale from 7 to 3 without too 
much sacrifice in reliability. The scales have been translated into 
several modern languages and used in a rtumber of countries in 
Africa, the Americas, Asja, and Europe. 

The My Class Inventory is an adaptation of the lei for 
elementary school research. It consists of 45 items, 9 items per 
scale on each of 5 scales. The vocabulary level of the items has^ 
been reduced; but it i^ sometimes found to be necessary to Yead 
and explain the jtems to some students and explain items to 
classes-of poor readers. Working from these two standardized 
instruments, a number of investigators have modified the items 
and scales to suit them to their particular research interest and 
setting. 

Prediction of Learning Outconies 

From social-psychological research, Walberg (1969) 
formulated 36 hypotheses concerning the direction of relations 
between selected lei scales and learning criteria, namely that 
Cohesiveness, Satisfaction, Task Difficulty, Goal Direction, 
Democracy, Diversity, and Material Environment would be 
positively correlated, and that Friction, Cliqueness, Apathy, 
Favoritism, and Disorganization would be negatively correlated 
with the extent of gains in cognitive, affective, and behavioral 
learning. A tabulation of the results (Haertel; Walberg, & 
Haertei; 1979) across 10 large data sets shows that 3 1 out of 36 or 
86 percent of the signs support the formulations (Tabic l);.the 
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probability of these results occurring by chance is less than .00 L 
Three oY the five disconfirmations concern the Diversity scale 
which shows/generally negative relations with outcomes in all 
three learning domains^, ratKer than the hypothesized positive 
direction. The results which are tabulated in the last column of 
Table 1 show the general consistency * of the. findings in the 
research done in the past decade. 

To estimate the sizes of the correlations of student 
perceptions with learning outcomes and adjusted learning gains, 
734 correlations from a comprehensive collection of IZstudies of 
10 data sets, on 823 classes containing 17,805 students in eight 
-subject areas and four nations, were -synthesized (Haertcl, 
Walberg, & Haertcl, 1979). Table 2 shows a summary of the 
findings for two specific areas— high school history and 
elementary school reading. Most of the investigations upon 

{ . * 

' Table 2 

Correlations of Class-Mean Student Perceptions with Gain.s in'Learning 



High School History Elementary SchoorReading 





Estimated 


Observed 


Estimated 


Observed 


\\ijea 

Cohe.si^ene.ss 




.81 


.17 


.00 


Satisfaction 


.45 


,63 ' 


.38 


.11 


-Friction 


-.80 


-,9() 


-.52 


■ -,37 


-Apathy 


-.33 


-.86 • 






-Cliquene.ss 


-.27 


-,74 






Sfatus 










Democracy 


.36 


,6; 






Competitix cness 


' ,19 


-.15 


-.23 


-.51 


-Diversity 


.19 


-.15 






-Favoritism 


-.31 . 


-.59 






Ta.sk . . 
Goal I?irection 


.39 


,66 






Material Environriient ,55 


.80 






Formality 


.23 


-.12 , 






Difficulty 


.28- 


V ,26 


-.14 


-,27 


Speed 


.19 


-,15 






-Disorganization 


-.25 ■■ 


-,71 







.Vf>/c. The estiniiiies arc based on a quantitative synthesis (if more than 700 airrclations 
from 10 independent data sets (Haeriel. Waiberg. & llaeriel. 1^J79). 
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which the estimated correlations are based obtained pretest 
measures of achievement and interest in the subject during the 
first month of'the fall term, obtained similar posttest measures at 
the last month of the spring term, and measured stlident 
perceptions of the social-psychological environment of their 
classes during the middle of the school year. 

The correlations of student perceptions with adjusted 
gains in learning vary from low to high across variables and 
investigations; Table 2 shows examples from two studies. In the 
area of affective relations in the class; Friction in the group, that 
is, tensions and hostility among class members, is negatively 
correlated with learning outcomes; and Satisfaction and 
Cohesiveness are positively correlated with learning outcomes. 
In the general area of status within the classroom group, the 
extent to which the class is seen to be democratic and not 
favoring particular individuals in the group is moderately 
correlated with learning efficiency. In the area of task 
orientation, the perceptions of Goal Direction and the presence 
of materials and supplies in an adequate physical facility for 
carrying out the activities of the class are positively correlated 
with learning outcomes; and Disorganization is negatively 
correlated with learning gains. 

When the scales are weighted and added to form, a 
composite index of the social-psychological environment con- 
ducive to learning rather than using one scale at a time, the 
predictive validity increases substantially. Seven studies added 
sets of scales to regression equations that also included ability or 
pretest measures or both as controls. The average incremental 
variance accounted for in learning outcomes beyond that 
accounted for by ability and pretest on 19 learning outcomes is 20 
percent, with a range from 1 to 54 percent; these incremental 
variances are substantially greater than that accounted for by IQ 
when pretests are taken into account. Thus, regressions 
containing the control and perceptual variables account for a 
large amount and, in some cases, nearly all of the reliable 
variance in learning outcomes. These analyses show that the 
measures of the social-psychological environment during the 
course of learning afford a very accurate prediction of how much 
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will be learned durinj; the school year and serve as a useful index 
of the average amount that the class is learning at any given time. 

Perception and Reading 
Ri< Although learning outcomes and gains are positively 

associated in general with Cohesiveness, Satisfaction, Task 
Difficulty, Formality, Goal Direction, Democracy, and. Material 
Environment and negatively associated with Friction, Clique- 
ness. Apathy, and Disorganization, 8 of the 10 data sets have 
been gathered in various countries from secondary school classes 
» in the sciences, humanities, and social studies. Only two studies 
have been carried out in the elementary grades, and only one of 
these has concerned reading. 

Talmage and Walberg (1978) showed that student 
perceptions of their reading classes yielded the most accurate 
predictions of end-of-the-year standardized reading test scores 
with .adjustments for similar pretests among four sets of 

Table 3 . 

Corrdations of Learning Gains with Four Sets of Variables 

ReacJin}^ scries 
Ginn 360 

Holt. Rinehart. and Winston 
Lippincott 

Scott, Foresman Sy.stems 

Teacher c haracteristics 
Years in teaching 
Workshops attended 
. Reading courses 
Awareness of design 

Instruction 

Decisionmaking 
, Student behavior 

Materials utilization 

hlnvironmcftt 
Cohesiveness 
Competitiveness 
Difficulty 
Friction 
Satisfaction 

Sole. One. two, and three iisicrisks mark ci>rrelalions rcspcdivcly jiigriLficant al Ihc .10. 
and .01 levels. 



.00 
.0« 
22* 
-.23* 



.22* 
-.01 
-.1.1 
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prediction variables. Table 3 shows that, for a sample of 60 
classes, grades one through six, the reading series used, teacher 
characteristics, and the nature of instruction were very weakly 
correlated with adjusted learning gains. Competitiveness, 
Difficulty, and Friction were moderately correlated in a negative 
direction with the extent of gains on standardized reading tests. 

A selection of items on the My Class Inventory for these 
scales illustrates the characteristics of classes in which poor 
reading gains were made. On the Friction scale, several items are: 
"Some of the children in our class are mean." "Children are 
always fighting with each other." and "Some pupils don't like 
other pupils." On the Competitiveness scale are the following 
items: "The same people always do the best work in our class " 
"Children often race to see who can finish first." and "Most 
children want their work to be better than their friends' work." 
On the Difficuity scale are the following items: "In our class the 
work is hard to do." "Only the sm.art people can do the work in ' 
our class." and "Children often find their work hard.'' 

These results are important for several reasons. First, they 
show that the general predictive validity of student perceptions 
found in secondary classes is initially confirmed in elementary 
school. Second, they call attention to the need for further 
research on social-psychological perceptions in elementary 
school' reading. Although Friction is negatively associated with 
learning outcomes in elementary reading and thus confirms the 
results from secondary classes, the other two scales are reversed 
in their signs; that is, although Competitiveness and Difficulty 
are weakly and positively related to learning gains in secondary 
classes, they are negatively correlated in the sample of elem^^ 
school'students with reading achievement as the outcome. The^c 
reversals in sign may be attributable^to the idiosyncracies of a 
single study, or they may indicate that substantially different 
social-psychological environments are conducive to learning in 
elementary school. For example, it may be that difficult, highly 
competitive norms are conducive to learning in high school 
students but deter reading gains in elementary school students. 

As mentioned earlier, the My Class Inventory contains 5 
of the 1 5 scales from the Learning Environment Inventory: it is a 
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matter of speculation how the other 10 scales will relate to • 
learning outcomes'in elementary schools. From the research 
synthesis on learning environments and from prior social- 
psychological research in laboratory and natural settings of 

\ learning, it seems reasonable to infer.that subsequent research in 
\ all levels of education will continue to show that Cohesiveness, 
Satisfaction, Goal Direction, Democracy, and the Material 
Environment enhance learning, and that Friction, Cliqucness, 
Ai^^ithy. and Disorganization deter learning. However, addi- 
tion^il studies in elementary schools and in the field of reading, in 

" partit^ular, arc urgently needed to confirm ordisconfirm these 

inferences. , . • 

\ 

\ 

Perceptions as Goals 

Because learning environinent scales provide a predictive- 
ly valid index of the amount of learning gains made during the 
academic year as indexed by standard tests, tht scales can 
occasionallv serve as convenient substitutes for the standardized 
achievement, and comprehension tests themselves. Since the 
<;cales are closely linked to affective and behavioral outcomes as 
well, they are useful indices in a variety of settings a nd subject 
matters. At the secondary level, the scales are fairly context fre6; 
that is, social-psychological perceptions conducive to learning in 
one-set of circumstances are also conducive to learning in other 
sets of Circumstances. 

Bv^cause different curricula, teachers, and forms df 
instructiri; emphasize different sets of gpais and outcomes of 
learning, standardized tests arc not always an accurate or fail > 
assessment across ler.rning settings; arid, in these circumstances, 
the i.Ei scales form a useful index of curriculum, instructional, 
and other kinds of educational e ffects. Since the earlier research 
demonstrated the predictive validity of the instruments, recent 
V, research has employed the scales to provide stktistically-reliable' 
sensitivity to educational treatments such as curriculum, teacher 
training, and instructional innovations, as well as to project 
efforts to increase student ceamwo; K, cross-sex and cross-ethnic 
group cooperation, and similar properties. Other contemporary 
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work reveals that student perceptions reflect and mediate teacher 
and student characteristics, and that they provide diagnostically- 
valuable profiles for class and individual morale or climate that 
can be used in needs assessments and impact evaluations. More 
than 6t)0 investigators from many natiort^have requested the i.Ei 
and MCI scales for their work in evaluation and research. 

In summary, scales measuring student perceptions of the 

• 'social-psychological environment of learning provide useful 
independent, mediating, and dependent variables in educational 
investigations in natural settings. They complement and 
supplement current behavioral and cognitive variables and reveal 
social realities in classes that are neglected in laboratory-derived 
research. Even if the purpose of a research effort does not 
concern student perceptions themselves, the perceptions can be 

. used as control variables so that subtle cognitive and behavioral 
effects may be detected in the complexities of natural settings. 
Lastly, it is the student that is the client of the educational system 
and the person that the system is attempting to infiuence. Surely 
these perceptions should be one basis of consideration and 
accountability in the evaluation of instruction, curriculum, and 
other aspects of the educational program.* 

' This section illustrates the importance of student 
perceptions of their classes. The next section of this chapter 
outlines three models .of reading which incorporate the 
dimension of comprehension. The last section links research on 
student perceptions to components of these models to hypothe- 
size how comprehension is affected by the social-psychological 
environment in natural settings of learning. 

Models of Reading 

Many reading teachers and researchers would subscribe 
to the following description of reading comprehension by Gibson 
and Levin (1975): "We comprehend the meaning of a word, the 
meaning of a sentence, or the meaning of a passage of discourse 
when we apprehend the intention of the Writer and succeed in 
relating his message to the larger context of^our own system of 
knowledge; (p. 400). Like other descriptions of reading 
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comprehension, this is very general and lends itself to different 
interpretations of how the process of reading comprehension 
actually operates. The following discussion briefly describes 
three much-quoted, yet very different, reading models and points 
out their similarities and differences in postulated structures and 
processes. Common and distinctive features of the three models 
provide leverage points for hypothesized influences of student 
perceptions of their learning environment on reading compre- 
hension. 

Current literature illustrates at least three types of reading 
models: bottom-up, top-down, and interactive. These designa- 
tions refer to different assumptions about the nature of the 
reading process. If reading is initiated by visual stimuli or print, 
requiring little input from the reader, processing is referred to as 
bottom-up; Cough's (1972) model of the reading process 
exemplifies this type of model. If reading is initiated by reader- 
generated hypotheses, requiring fewer visual cues, processing is 
referred to as top-down; Goodman's (1970) model of the 
proficient reader adheres fairly well to the characterization. If 
reading requires bottom-up as well as top-down processing, the 
model is referred to as interactive; Rumelhart's (1975) work 
represents this type of model. 

Despite their different characterizations of the reading 
process, certain structures and processes are common to the three 
models. All the models incorporate ""visual activities as initial 
processes, although only Goodman hypothesizes in his 'model 
that the reader must bring expectations about the nature of the 
reading task from both long- and medium-term memory 
structures to this process. The three authors differ in their notions 
about the kind of information which is first held in short-term 
memory. Gough and Rumelhart view this information as finely 
differentiated as series of lines, curves, and angles, while 
Goodman mentions key letters, e.g., beginning consonants, as at 
least one type of initially-stored graphic input. 

* In Rumelhart's model, a feature-extraction process sorts 
critical from noncritical letter features and deposits critical 
features into a pattern synthesizer. This pattern synthesizer 
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appears to be a form of short-term memory. For Cough, the 
parallel process converts letter features directly into letters which 
are scored in the short-term character registc^n According to 
Cough, an abstract phonemic counterpart is then attached to the 
letter representation and stored on a phonemic tape. The 
phonemic representations dre subsequently acted on by a word- 
recognition process and stored in what Cough calls primary or 
working memory — apparently, another form of short-term 
memory. 

Clustering of graphemic information proceeds linearly up 
to the word level in Cough's model. Goodman, on the other 
hand, suggests that the original perceptual image of graphemic 
cues is formed and stored in short-term memory until a search in 
long-term memory yields related phonological, syntactic, and 
semantic cues. These latter cues are then transferred to short- 
term memory for comparison with the perceptual image. 

Rumelhart's model draws together many levels of 
information in a multiple-process, pattern-synthesis stage. 
Besides critical features of letters previously mentioned, other 
levels of information include orthographic (lettercluster), lexical 
(word), syntactic, and semantic cues, all of which are processed to 
produce *'the most probable interpretation" or meaning. 
Processing at any level of information can effect processing at 
any other level. Compared to Cough's and Coodman's models, 
the Rumelhart model is most explicit about disallowing use of 
extraneous information. For example, in the sentence "Birds of a 
feather flock together," a reader who had encountered the slogan 
in his or her previous experience might be able to .achieve "the 
most probable interpretation" by utilizing syntactic and semantic 
levels of information after relying on lower levels of information 
for the first three words. Such a leap to higher levels of 
information would not be possible in Cough's model; and while 
Coodman would believe such a leap possible, his possible 
sequences of processes are not so well-defined as in the 
Rumelhart model. 

Word representations from Cough's primary memory are 
acted on by a sentence-comprehension process that applies 
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syntactic or semantic rules to these words. Newly-understood 
sentences arc then sent to TPWSGWTAU (The Place Where 
Sentences Go When They Are Understood), or long-term 
memory— What Gough calls secondary memory. Unlike Rumel- 
hart's view, then, the comprehension process for Gough is 
isolated and \c(i to the very end of the sequence of reading 
processes. 

Goodman s conversion to meaning, or comprehension, 
the heart of his model, results in elaborate hypothesis-testing 
processes. Following comparison of the perceptual image in 
short-term memory with identified cues from long-term memory, 
a check for a match between the two is made. If a match is made, 
then information in .some unspecified form is sent to medium- 
term memory, where another test for appropriateness against 
previously-developed syntactic and semantic choices occurs. 
Presumably, these choices are available from prior context. If the 
choices match, comprehension follows. If choices do not match, 
other options are available; for example, a reader may regress to 
the beginning and start the entire sequence again. 

Within these models, it is possible to identify common 
variables that a reader's perception of the learning environment 
may influence. These include the size of the information units 
that are processed, the number of units processed per unit of 
'"^ime, and the storage capacity of short-term memory, Simon 
(1969r^xBmines these variables and defines a "chunk" as a 
familiar unit of information. He not only shows that from two to 
seven chunks may be held in short-termjTiemory, but also that it 
takes five seconds to move a chunk from sfiort-tcrrn to 
long-term memory. This time can be lowered, moreovcrv^by 
increasing the meaningfulness of the chunks. It seems reasonable 
to hypothesi/.e that a less than optimal environment in the form 
of inappropriately high levels of competitiveness, friction, .and 
difficulty will cause errors in the input of chunks to short-term 
memory, reduction in the number of short-term memory and 
long-term memory items, and slowed processing times. For 
example, readers' perception of friction in their learning 
environment may so distract them as to adversely affect their 
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visual-system activities at the very first stage of reading, Because 
four fixations take just one second,. according to Gough, the 
readers' processing of the lines, curves, and angles may be either 
incomplete or incorrect. Readers may not garner all the 
information available for short-term .memory, or may not garner 
sufficient information to fill short-term memory, but incorrectly 
perceive some of the graphemic input. The fact that readers have 
not correctly or completely processed the lines, curves, and 
angles is not as critical in top-down models because, in these 
models, readers have access to other sources of information, such 
as world knowledge. Furthermore, the possiblity of regression is 
not only allowed, but encouraged, in top-down models, thereby 
providing readers with the opportunity to gather up missed cues. 

A reader who is distracted by ineffective class morale in 
early stages of Cough's version of the reading process will be 
more hampered in his progress toward comprehension than one, 
who encounters interference in later stages. Because no 
backtracking occurs in Cough's model, a reader who is closer to 
the final stage of processing before things go wrong will be more 
likely to derive some approximation of the author's intended 
meaning. 

In top-down models, as^mentioned earlier, even if there 
are errors in initial processing of chunks of information, they are 
not as critical as they might be in a bottom-up model for a 
number of reasons. The regressive and interactive features of 
Coodman's and Rumelhart's models, respectively, as well as the 
possibility of relying on more sources of information than initial 
visual ones alone, should lead to fewer errors based on incorrect 
or incomplete perception of visuaLcues. Because lop-down 
models do depend on other sources of information, these other 
sources are all somewhat susceptible to the influences of learner 
perceptions of the social environment. 

In either Coodman's or Rumelhart's models, comprehen- 
sion is virtually impossible without input from the reader. For 
example, hypothesis-testing is a crucial part of Goodman's model 
that requires readers to make guesses aboift^forthcoming content. 
Students who are preoccupied by social-psychological problems 
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in their classrooms may be less likely to play the "psycho- 
linguistic guessing game/' as Goodman terms this risk-taking 
application of his model. Likewise, studencs who are troubled by 
perceptions of competition, frictior, or difficulty in their 
classroom environments may be unable to synthesize at the most 
crucial stage of Rumelhart's model 

In conjunction with the above, in these models, readers 
must bring prior knowledge and expectations of the world 
around them to their understanding of" the printed page. If 
readers are unduly concerned by psychologically threatening 
aspects of their environment, they may be less likely to utili/.e this 
personal knowledge. Because they may fail to make full use of 
available "scripts" about things they do kfiQ,^. due to 
environmental distractions, comprehension may be seriously 
hampered. 

in Goodman's and Rumelhart s models>,a reader would, 
under such conditions, be less vulnerable at any one stage of 
processing. That is, meaning cues would be accessible from a 
variety of sources, and would not be blocked by inadequate 
decoding alon^:. The time necessary for a given amount of 
comprehension, however, might be greatly extended by percep- 
tion of a frustrating environment. 

Implications for Research and for the Teach in}^ of 
Comprehension 

The three model-builders selected ; comparison im- 
plicitly incorporate variables of informat..:s: .nit size, number 
processed per unit of time, and short-teim men ory-storage 
capacity, although each does so in different ways. Within each 
model, these variables may be found to be related to the influence 
of the social-psychcjlogical variables in the learning environment 
perceived by students. .As mentioned earlier, student perceptions 
which inde.x the environment accurately predict gains on 
learning-outcome measure3, for example, reading comprehen- 
sion scores. This relationship between student perceptions and 
reading comprehension needs to be more closely explored in 
terms of actual everxts in the classroom. 
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Based upon Talmage and Walberg's (1978) elementary 
school findings thkt Competitiveness, Difficulty, and Friction 
are negatively correlated with gains on a siandardized readmg 
test, the following questions may be raised: Where in a reading 
class situation might student-s perceive competitiveness, diffi- 
culty and friction, and how might these perceptions be related to 
reading comprehension as hypothesized by the models? 

Some reading situations.may prove more cortducive to the 
perce^tipns of competitiveness and fricticyi than others. For 
exTml^le, with regard to grouping, students involved in cross- 
class or interclass grouping for reading would find themselves 
with others of similar reading ability, as more teachers would be 
available to accommodatd the range of reading levels commonly 
found at any one grade. Such a situation might lend itself to a less 
competitive atmosphere than the more commonly-found intra- 
class grouping situation might. Experiments along this line 
would be interesting to condu.:. 

Children may also find themselves in more or less 
competitive environments, depending upon th«. reading approach 
used Children in an individualized reading program, where 
children v0rk at their own pace on materials specifically tailored 
for their needs-at least, in the ideal case-may be unaware of 
other children's reading successes or failures. Those children in a 
basal reading program, where children are grouped according to 
their reading ability, may be acutely aware of others' reading 
successes or failures. Thus, a more likely breeding ground for 
competition is established. ^ 

Other subtle aspects of the classroom environment— what 
Jackson (1968) terms "the hidden curriculum"-may serve to 
promote student perceptions of friction. Teachers foster certain 
ideals by displaying in the classroom only those language- 
experience stories or book reports that conform to their own 
expectations, while other work not so favoi-ed is returned. A 
teacher may inadvertently create friction in a classroom by this 
kind of favoritism. Feelings of friction among peers might also be 
generated when poor readers are continually corrected in their 
oral reading by good readers. And, as demonstrated earlier with 
regard to competition, the reading program itself may engender 
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an environment of frietion. For example, reading programs ' 
which emphasize only phonics instruetion will reward those 
students who can sound out words. Those students who must rely 
or who best rely on other strategies when reading, such as making 
use ofcontext or prior knowledge, will, in sueh a program, garner 
less recognition and praise from the teacher Under these 
instructional conditions,* students who would in another setting 
normally be called good readers are less favored than students 
who are comfortable with the phonics strategy. Student 
perceptions of competitiveness and friction in certain classrooms 
may, therefore, cause feelings of frustration and impaired 
reading comprehension. 

Reading situations may also foster student perceptions of 
difficulty. Recent literature provides numerous examples of mis- 
matches which may lead to perceptions of difficulty in the 
classroom. One such mismatch exists when teachers fail to match 
instruction to student needs. Carroll(l97|)documented thewide 
variability of reading grade levels from fourth graders to college 
sophomores and noted that the test performance ol'the top one 
percent of the fourth graders was identical to the bottom fifth 
percentile of the college sophomores. Teachers who do not adapt 
to such variability will be likely to make unreasonable demands 
of their low readers, while not demanding enough of their 
accelerated readers. In this way, reading instruction may be 
perceived by students either as to<^ difficult or not difficult 
enough. The statistical evidence gathered thus far suggests that 
elementary reading classes may be too difficult, on the average, 
but that high school classes in most subjects are not challenging 
enough (Haertcl. Walberg. & Haertel, 1979, Tables 2 and 3). 

I n the same vein, research findings on teacher questioning 
practices are pertinent. Anderson. Spiro, and Montague (1977) 
favor the use of questioning to extend comprehension; however, 
Guszak (1968) and Bartolome (1969) both note the prevalence of 
teachers' questioning at the lower, literal levels. Such practices 
may stifle the critical thinking abilities of accelerated students. 

Goldberg (1973) observed teachers who were stressing 
comprehension in their reading programs, while students were 
yet decoding. Teachers were again making inappropriate 
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demands of their stiidents--at least, Gough's view of reading is 
consistent with this assumption. Guthrie and Tyler's evidence 
(1978) that indicates that poor readers at all grade levels— but 
especially at primary grade levels—are hampered by inadequate 
decoding or word recognition skills also supports Gough here. 
Insufficient decoding skills are not as great an obstaele to a 
reader's understanding for the'oiher two model-builderii miainly 
because, for them, routes to meaning are numerous Poor 
readers, even in the primary grades, would be expected to ■ ^ow as 
many comprehension-related as decoding-related deficits. 

Not only mismatches of' instruction and student needs 
may occur. Another kind of mismatch leading to perception of ' 
difficulty may occur between students' language and experience 
" and textual language and content. Ruddell (1965), among others, 
showed that comprehension was impeded when syntactic 
patterns of the text did not match those of children. Unfamiliar 
language may be perceived to be more confusing and difficult 
than familiar language patterns. Likewise, reading comprehen- 
sion has been shown to be enhanced when students possess prior 
knowledge about the content of the materials read (Gordpn, 
Hansen, 8l Pearson, 1978). Thus, student perceptions of 
difficulty caused by mismatching of instruction or materials to 
readers' needs can easily be linked to reading comprehension. 

Other scales not yet studied with regard to the elementary 
classroom may contribute to perspectives on reading compre- 
hension. For example,'student perception of Goal Directio^n may 
turn out to correlate positively with reading outcome measures at 
the elcmentarv as well as the secondary level. A study by the 
American Ins'titute for Research (Bowers, 1974) found that 
outstanding reading programs appeared to share many com- 
ponents associated with goal direction: 

aciulemic nbieciivcs that were clearly stated, that were broken inr- 
smaller units, and that izavc evidence of caictul planning.. h?^:i.,v 
structured teaching directly relevant to ihe objectives providing 'lo'l'. 
feedback and diagnostic information, (p. 56) 

Future research in elementary school reading classes testing the 
negative inlluence of Apathy, Friction, Cliqucncss, Disorgani- 
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zation, and Fcivoritism and the positive influence of. Democracy, 
Material Environment, Cohesiveness, and Satisfaction on 
learning should be interesting and valuable in extending our 
general knowledge of how social-psychological perceptions 
influence learning and our specificunderstanding of the teaching; 
and learning of reading comprehension;. 
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Our airti in 'this paper is to summarize the scientific knowledge 
base which underlies reading comprehension instruction in the 
middle elementary years. Of particular interest are specific 
instructional practices rather "than comprehensive reading 
curricula as represented by basal reading programs. Little can be 
said about eitHe^ the relative or absolute effects of various 
reading curricula since virtually no research exists on the subject 
(Jenkins & ,Pany, in press). That there is so little information 
about the/effects of reading curricula is remarkable given the 
overwhelming reliance on these programs by schools. -Even if 
data w,ere available on the effectiveness of various reading 
currici^ia it woujd be difficult to determine which aspects of the 
programs were functional and which were frivolous, since the 
programs are comprehensive and each . contains a broad 
coU'ectipn of instructional practices.Relatively more research has 
be4n conducted on a host of more circumscribed instructional 
variables. ' ^ ' . 

I The decision to focus on studies conducted with children 
in the middle elementary grades was based on several 
considerations. Below grade three, reading instruction typically 
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emphasizes word decoding rather than comprehension, a 
practice that is not without its critics(Smith, 1973). Beyond grade 
eight, classroom instruction becomes increasingly cc itent 
oriented, with less emphasis given to reading process. It is in the 
middle elementary grades that schools explicitly admit to 
teaching reading comprehension (Mason & Boggs, Note 1). By 
confining our review to studies conducted with children as 
opposed to those with more mature readers, we do not mean to 
imply that studies with the latter group are without relevance, 
only that we are primarily interested in instructional factors 
which affect the development of the ability to comprehend 
written discourse. 

For .the purpose of this chapter, we liberally define 
reading comprehension and reading comprehension instruction. 
We accept as evidence of comprehension such varied perform- 
ances as answering passage-dependent questions, retelling facts 
or ideas stated in a reading selection, completing cloze tests, and 
orally by^reading passages for miscue analysis. Likewise, we take 
a broad view of reading comprehension instruction to include 
deliberate attempts by a researcher to modify children's 
comprehension or memory for prose. In our sense of the term 
instructional practices for reading comprehension are those 
factors external io the text which can be manipulated to influence 
comprehension and memory for it. Thus, such variables as 
advance organizers*, pictures, or paragraph heads can be thought 
of as instructional variables. Teaching children skills related to 
rapid decoding, vocabulary, listening, cloze performance, and 
imagery also qualify for inclusion since instruction focuses on 
factors external to the text. Not all variables that mfluence 
comprehension qualify as instructioiial variables, however. For 
example, story plot (Thorndyke^ 1977), text organization 
(Meyer, 1975), and syntactic structures (Chomsky, 1972) are 
factors which influence the comprehensibility of prose, but they, 
are not instructional variables in the sense in which we use this 
term. Passage characteristics such as those cited above definitely 
can affect a reader's acquisition of an-author^s message andean 
legitimately be considered instructional variables wTtHTespect-to 
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this imrndcd message. However, since they are characteristics of 
a paiticular passage they cannol be manipulated without 
changing the passage itself. We do not consider the modification 
cf a passage to make it more comprehensible to be an instance of 
teaching reading cofvprchcnsiun. 

With reading comprehension and its instruction defined 
in this way, there is considerable diversity in the studies which 
qualify as instructional research on reading comprehension. The 
diversity is a serious obstacle for anyone attempting to organize 
and comment on the present status of knowledge in the teaching 
of reading comprehension. A good theory of reading would 
certainly help to identify promising instructional interventions 
and to aid in the understanding and classification of variables 
affecting comprehension, but such a theory has yet to emerge. 
Without a theory and in search of a framework in which to 
organize a wide array of instructional variables, we noted that we 
could identify lines of research which appeared to focus on 
different aspects of reading comprehension. The variables chosen 
for study, to some extent, reflect what an investigator believed to 
be the source of reading comprehension failure. We were able to 
identify three general sources of comprehension failure which 
could serve as a basis for grouping instructional research on this 
problem. 

Causes of Reading Comprehension Failure 

One line of instructional research emanates from a "top 
down" view of the reading process. This view highlights the' 
individual's use of existing knowledge structures to interpret and 
organize prose. Individuals might fail to comprehend because 
they lack the appropriate abstract kno\yledge structures or 
schemata needed for making sense out of text (Anderson, 1977; 
Spiro, 1978). In some instances the individual may even possess 
appropriate schemata but for some reason the text does not 
activate these structures. In either case the individual is unable to 
make the necessary cognitive contributions required for 
producing the "click" of comprehension (Brown, 1968). Classic 
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illustrations of these two situations have been provided by 
Bransford and Johnson (1972), From the standpoint of 
instruction, a variable such as an advance organizer reflects a 
top-down orientation. Organizers are hypothesized to enhance 
comprehension through their influence on cognitive structure. 

I n contrast to this top-down orientation, other analyses of 
comprehension emphasize "bottom-up" or text-driven process- 
ing (Spiro, 1979). Concern is directed more toward the linguistic 
demands of the text than toward cognitive structure variables 
and background knowledge of the person reading the text. 
According t6 this "bottom-up" analysis, comprehension 
problems are likely to result from passage-related variables such 
as complex or unfamiliar vocabulary, sentence syntax, and text 
organization or from passages that require special thinking skills 
(for example, syllogistic reasoning). Accordingly, comprehen- 
sion instruction attempts to provide the individual with the 
essential skills needed to simplify the linguistic input of text. 
Examples of comprehension instruction that emanate from a 
bottom-up orientation include teaching children word meanings, 
to decode rapidly, to interpret complex sentences, and to apply 
reasoning strategies. 

Whereas these two views of reading comprehension- 
emphasize schema (top-down) and text-driven (bottom-up) 
processing, the third analysis focuses on attentional factors and 
levels of processing (Craik & Lockhart, 1972). Of interest are 
comprehension problems that result from inattentive, superficial 
reading or from ^lisdirected attention (e.g. concern with correct 
word calling rather than meaning getting). Here, the reader is 
assumed to possess adequate background knowledge along with 
satisfactory linguistic and reasoning skill, but may not have 
learned to maintain focus on and self-monitor comprehension, or 
to employ systematic memory strategies. Instructional research 
examines variables that curriculum writers rhay employ to 
maintain reader attention (e.g. adjunct questions), that readers 
may use to monitor their own memory or comprehension 
processes (e.g. imaging), and that teachers may use to shape 
attentional focus (e.g. corrective feedback that emphasizes 
meaning vs. sound symbol relations). 
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We have categorized instructional research on reading 
comprehension into these three classifications: background 
knowledge,' or schemata, linguistic/ reasoning skills, and atten- 
tional/ processing factors. Admittedly, not all the instructional 
, practices we examined fit neatly into these classifications and one 
can take exception to oyr classification of particular, variables. 
For that matter, we take exception to our own classification at 
times, if that provides any comfort. Nevertheless, these categories 
should provide a framework for grouping some very different 
instructional variables, and should help prompt speculations as 
to what a researcher believes is the source of failure to 
comprehend. 

Instructional Variables Addressing 
Schema- Related Problems 

Students may fail to comprehend a passage betausethey 
lack the I'elevant background knowledge or schemata needed for 
constructing meaning from the textual recipe, Reading instruc- 
tion that focuses on improving language abilities or on increasing 
semantjc processing will not resolve the comprehension problem 
since the problem results from a reader's inability to make the 
needed cognitive contributions to a text rather than from 
language or attentional inadequacies. 

Does background knowledge really affect comprehension 
in normal reading situations? It is noteworthy that the most 
convincing demonstrations of the importance of background 
knowledge have occurred with contrived, and often ambiguous, 
reading passages (Bransford & Johnson, 1972), Hov^ever, a recent 
study by Pearson, Hansen, and Gordon (1979) suggests that 
differences in background knowledge may indeed abcount for a 
significant portion of the variance in comprehension perfor- 
'mances in normal reading situations, in this case with passages 
taken from a second grade reader, Pearson et al, selecte^d children 
who were comparable in intelligence and in "reading comprehen- 
sion ability," but who varied in background knowle(;ige for a 
particular topic. These students' comprehension of a passage on 
that topic appeased to be a function of their prior knowledge of 
the topic. Notabiy the effects of. background knowledge were 
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more pronounced on "scriptually implicit" questions (those that 
require an integration of textual and background data) than for 
"textually explicit" questions (those that require responses 
directly from the text). Pearson et al. interpret their findings as 
supportive of the notion that comprehension involves integrating 
new information into pre-established schemata and that such 
integration is seriously impaired for students with poorly, 
developed schemata. Their conclusion would be strengthened 
considerably if the experimenters had employed a "test only" 
control. It could be argued that comprehension differences 
between high and low schema groups would have existed even if 
students had not read the experimental passage, but had merely 
been tested. 

Several studies that attempted to facilitate reading 
comprehension have employed variables intended to establish, 
modify, or activate background knowledge or schemata. We 
include research on advance organizers, pictures, and certain 
textual adjuncts such as titles and paragraph headings. We 
interpret advance organizers and pictures primarily as attempts 
to provide the reader with background knowledge or relevant 
schemata. Titles and paragraph heads (and sometimes advance 
organizers and pictures) may be thought of as mechanisms for^ 
helping the reader identify an already available schematic 
framework with which to integrate textual information. 
Research on these variables is examined in'this section. 

■ Advance Organizers 

Ausubel (1963) has formulated one instructional proce- 
dure for enhancing reading comprehension that gives special 
consideration to the learners background knowledge. He concep- 
tualized ?.he advance organizer as a '\ . , bridge. . . between what 
the learner already Icnows and what he needs to know in order to 
learn new subject matter effectively" (Ausubel, 1978, p. 253). An 
svance organizer consists of introductory material which is 
. elated to already existing ideas in the learner s head and which is 
written at a higher level of abstraction, generality, and 
inclusiveness than the learning passage itself. 
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While numerous researchers have investigated the effec- 
tiveness of advance organizers, their findings have been largely 
inconsistent. More often than not, however, no advantage was 
found for groups who received an advance organizer (Barnes & 
Clawson, 1975), Several explanations could account for the 
apparent failure of advance organizer interventions. It is possible 
that organizers did not assist readers to relate new information to 
existing knowledge. This could occur if the organizer failed to tie 
together the two sets of information, or if the reader did not 
possess the presumed prior knowledge to which the organizer 
was addressed. Another possibility isNthat readers in these 
experiments already possessed readily avaih^ble schemata for the 
learning passages; and for them the advance organizer treatment 
was superfluous. Thus, the performance of an organizer 
group would not exceed that of a control group. Whatever the 
explanation, advance organizers, at least in their present state of 
definition, do not represent a high success intc^rvention for 
enhancing comprehension. \ 

^\ 

Titles and Paragraph Heads '^v^ 

Under certain conditions, for example, when asked to 
read hightyambiguous passages, mature readers' comprehension 
can be considerably influenced if they are given thematic sto^y 
titles (Bransford & McCarrell, 1975). Presumably the title offers 
a relevant framework with which to interpret or disambiguate the 
passages. 

Doctorow, Wittrock, and Marks (1978) extended the use 
of titles to unambiguous prose passages. Sixth grade students 
read passages that were accompanied by headings, which were 
high-frequency synonyms for words central to each paragraph's 
topic. These students answered 43 percent more comprehension 
questions correctly than did students who read the passages 
without the aid of the'"hea"dings.Tn"inteTpret"iTrg~t'h^^^^^ 
Doctorow et al. propose that paragraph headings function as 
retrieval cues which assist the reader in locating relevant 
memories, or background knowledge. Presumably, the retrieval 
of these memories allows the reader to construct meaningful 
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elaborations for the textual information. In tir'-n, these 
elaborations.enhance comprehension and memory for the text. 

These findings of Doctorow et al. are inconsistent with 
results of several other investigations. In other studies of 
children (Landry, 1967; Suavely, 1962), adolescents (Cole, 1977), 
and adults (Christensen & Stordahl, 1955), no advantage was 
'found for passages with paragraph headings. It is difficult to 
reconcile these conflicting results. Fortunately, the best designed 
experiment of the group is the one which reported compre- 
hension facilitation from the paragraph headings. It is also the 
only study which carefully defined the procedures for creatingthe 
thematic paragraph headings. The' sizable effects reported by 
Doctorow et al. warrant further research in this area. 

Pictures 

Pictures enhance comprehension in both reading and 
listening tasks (Levin & Lesgold, in press; Schallert, in press). 
Despite some evidence to the contrary (Samuels, 1 970), the data 
showing facilitation by pictures are remarkably consistent. In 
reviewing the literature on picture use, Schalleri concludes that 
pictures will be helpful: 

...when they illustrate information central to the text,. when they 
represent new content which is important to the overall message, and 
when they depict structural relationships mentioned in the text. In 
addition, pictures seem to have a specific effect which is localized 
mainly te» illustrate information 4nd which amounts to more than a 
- second rehearsal of- the text [in press]. ^ 

Moreover, the effects of pictures are reasonably robust, 
with facilitation observed across a wide age range, large social 
class differences, varying intelligence levels, varying lengths of 
passages, verbatim and paraphrase test items, and immediate and 
d eTay ed ' tes t s ■( L evi ri^& in press). 7 ' 

Do pictures enhance comprehension because they alter 
schema? We feel it is only partially correct to classify pictures as a 
background knowledge variable which affects comprehension. 
In some circumstances, pictures definitely provide a framework 
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for interpreting textual information. Bransford and Johnson's 
(1972) Modern Day Romeo experiment, in which the text is 
incomprehensible^without an accompanying picture, represents 
the most compelling demonstration of a picture functioning as a 
schematic, contextual variable. In other circumstances, pictures 
* may enhances comprehension because they provid'e different 
'and/ or better information than text. Kolers (1973) and Schallert 
(in press) both hypothesize. that compared to words, illustrations 
can better depict spatial and structural information, finally, in 
those circuYnstances where pictures have enhanced compre- 
hension despite the fact that the information to be derived from 
the picture was explicity stated in the text, the pictures may have 
modified the reader's attention and semantic processing of the 
text. For example, pictures may induce the reader to actively 
elaborate the text with mental imagery. 

Regardless of the reason for the facilitative effects of 
pictures, they appear to represent an instructional variable that a 
' curriculum developer or teacher could use to promote reading 
comprehension. We would argue, however, that pictures hj&ve 
rather circumscribed effects. Like advance organizers, titles, and 
subheadinjgs, they are an instructional variable appropriate to 
situations where the student is reading for new information as 
opposed to learning to read. There is no evidence that teachers 
can use pictures to make their students better comprehenders in 

. any general sense. Providing children with texts plus pictures is 
not likely to affect -.their subsequent success Ln comprehending, 

. other text without pictures. On the other hand, if background 
knowledge is as important to reading comprehension as- if 
appears to be (Anderson, 1977), then variables such as pictures 
and paragraph headings will impact, at least indirectly, on 
reading comprehension ability because the reader will have 
acquired broader and deeper background knowledge, having 
read materials which were accompanied by these aids. 

Most background knowledge variables, such as orga- 
nizers, titles, and pictures, do not represent skills which would be 
taught. Rather, they are adjuncts to specific passages that are 
supplied by authors and editors to help readers integrate the 
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passage information with their existing knowledge. A different 
kind of background knowledge variable is represented by rules or 
generalizations which affect 'the readers ability to 'interpret 
information from a variety of passages. A study by Clements, 
Stevens, Kameenui, and Carnine (Note 2) illustrates this kind of 
prior knowledge variable. In their study, they taught two groups 
of students to identify and interpret motives of story characters. 
Stories presented either an apparent motivation of a character or 
both an apparent and true character motivation which had to be 
inferred from explicit information given in the stories. In a Rule 
and Questioning treatment, students were told: 'There may be 
more than one reason why someone docs or says something." 
Students then orally read a story, hcv rd the rule again, and 
proceeded to answer a series of questions designed to lead them 
to infer the true motive of the character in the story. In a 
Corrective Feedback treatment, students read a story and 
answered a question about a character's true motive. If students' 
answers were incorrect or partially correct, the experimenter 
modeled the correct answer, but did not tell the. rationale or 
derivation of the answer. Subsequently students were given novel^ 
passages and requested to identify character motives. The two 
training groups' performance was equivalent, and superior to 
that of an untrained group. Apparently students were able to 
induce the generalization about real and apparent motives from 
mere corrections and did not require the explicit statement of a 
rule, nor the explanation of how the rule applied to specific 
.stories. This study suggests that certain general rules can be 
taught which help students interpret different aspects of text. 
These rules could be considered a type of high level advance 
organizer or schemata which are applicable to many different 
. texts. ' 

'nstructional Variables Addressing Linguistic I Reasoning 
Problems 

While the reader's prior knowledge certainly plays an 
important role in what will be learned from text, it is not the only 
^ variable. It stands to reason that aspects of the text itself, the 
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reader's ability to deal with various linguistic structures, and the 
ability to reason about and interpret different kinds of messages 
all can affect the degree to which a text will be^understood and 
remembered. In this section we will review attempts to provide 
the reader with skills needed to simplify difficult aspects of text. 
Included are rapid decoding, vocabulary, cloze, organizational 
strategies, specific subskills instruction, and auding-reading,- 

Rapid Decoding 

Most reading authorities agree that some level of 
decoding proficiency is necessary for adequate reading compre- 
hension. In his text on the teaching of reading, Harris (1970) 
states, "Some very slow readers do poorly in comprehension 
because their many repetitions and hesitations break up the 
continuity of thought" (p. 447). Consistentwith the notion that 
decoding speed is a factor which influences reading compre- 
-hension, Perfetti and Hogaboam (1975) found that good and 
poor comprehenders differ in the speed with which they decode 
single words. Based on these data, Perfetti (1977) proposed a 
shared capacity or "bottleneck" hypothesis to account for the 
relationship between decoding speed and compreheiision. The 
basic notion is that Individuals possess limited processing space, 
and that decoding 'and comprehension are separate but 
interrelated tasks that both require that space. The more 
processing space consumed by decoding, the less processing 
space available for comprehension; thus, inefficient decodingcan 
detract from comprehension. 

LaBerge and Samuels (1974) also argiie for the impor- 
tance/of decoding speed or automaticity. In their model, the 
development of reading skills is marked by an increase in 
automatic processing of print. Automatic processing which 
results from advanced familiarity with letters and words is 
characterized by rapid and accurate response to print, such that 
the reader does not expend attention on the task of decoding. A 
proficient reader automatically processes words and their 
meanings, thus releasing attention for comprehending. 
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The data, base for the presumed influence of decoding on 
coniprehension is essentially correlational and, as Perfetti (1977) 

/• has indicated, the basis for asserting the causal relationship, 
between decbding rates and comprehension is still conceptual,, 
not empirical. That rapid automatic decoding should facilitate 
comprehension has enormous face validity and this notion gains 
support from data showing that good comprehenders tend also 
to be rapid decoders. But, it does not necessarily follow that 
helping children to become rapid decoders will help their 
comprehension. Indeed, little prior research has been conducted 
with poor readers on the effects of training in rap^d decoding. 
Two studies (Dahl, 1979; Samuels, Dahl, & Archwamety, 1974) ' 
attempted to examine this iss,ue experimentally Their results 

- indicated that groups of students who had received speeded 
isolated word training performed no better on comprehension 

' tests than did untrained students. However, in. neither of. these 
studies did word drill produce effects on speed of word 
recognition; thus, failure to observe transfer effects to compre- 
hension should have come as no surprise. 

A stronger test of the rapid decoding hypothesis was made 
in two experiments by Fleisher, Jenkins, and Pany (1979). ■ 
Fourth and fifth grade students were classified as good and poor 
readers according to their reading .comprehension test scores. 
One-half of the poor readers received rapid decoding training on 
all words from a test passage. Word drill was continued until 
these poor readers were able to read ^11 words that would appear 
in the test passage at a Jcate equivalent to that achieved by good 
readers. Next, test passages were presented to good readers, to ^ 
poor readers who had received rapid decoding training, and to i 
poor readers who had received no training. After reading the j 
passages the students were given several comprehension / 
measures including questions, story retell, and cloze. Altho'ugh/ 
there were minor procedural differences in the two experiments,/ 
in both cases the results indicated that poor readers who had 
received rapid decoding training performed no better than their 
nontrained poor, reader counterparts. Both groups of poo|- 
, readers performed significantly worse than the good reade^r 
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group on'the .comprehension measures, even though the trained 
po.or readers read the passages more fluentlv than their untrained 
Counterparts, However, despi^' ' ^rainin^^ in rapid decoding, 
'poor readers still readi signifi i s fluently in context than 
the good reader control grou results'suggest that'sirigle 

word training does not autom transfer to comprehension, 

and does not produce high levels of fluency in context. 

Dahl (1979) reported more optimistic results in an 
• '.'cmely . we'll designed study testirig the rapid decoding 
'^tHhesis: rtfer study deserves a high mark on the dimension of 
: *ogical validity in that it was conducted in the context of a 
iourth grade classroom on a daily basis throughout an entire 
school year. To develop fluency, experimejital students practiced 
reading 100-Word passages, They'reread each passage until they 
could complete it in one niinute. Whenever a student achieved 
this goal on one practice passa.ge, that passage was replaced by 
another. At the end of the year, these children were compared 
with control children who had engaged in '"regular" reading 
instruction' f6r an "equivalent length of time: Dahl's results 
indicated that children who were given repeated reading practice 
(to develop autortiaticity) exhibited superior performance on 
cloze measures, on errors during oralj;eading, oh reading rate, 
and on isolated word recognition. Also, it appeared that these 
children achieved significantly higher scores on the Gates- 
MacGinitie reading test (another compreheiision measure), 
although it is impossible to be certain of this, since Dahl's report 
did not include statistical details. ^ • . 

Dahrs findings are clearly at variance with those- of 
Fleisher et al. There are several possible explanations for the 
discrepancies. Whereas the Fleisher et al. training procedure 
emphasized single-word^ decoding, Dahl's repeated reading 
procedure provided extensive practice, on speeded reading in 
context. This difference along with the duration of DahPs 
treatment may have enabled . her students to. improve not only 
their fluency but also their organizational skills, for example, 
they could better segment complex sentences, into meaningful 
units. Moreover, it is possible that students need to attain a level 
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of fluency and maintain it lor some period before they can take 
advantage of their newly gained processing resources and apply 
them to comprehension work. By themselves. Dahl's findings 
provide sufficient grounds for remaining attentive to decoding 
speed and accuracy when considering instructional interventions 
to improve reading comprehension. 



Vocabulary 

• The importance of word meaning knowledge to reading 
comprehension would seem to be self-evident. According to 
Spache (1966): ''Understanding the vocabulary is second only to 
the factor of r-asoning in the process of comprehension, and 
some writers would say it is even more important than 
reasoning.... It is sufficient to say that comprehension is 
significantly promoted '^y attention to vocabulary growth" 
(p.78). While the research on vocabulary is enormous (Dale, 
Razik, & Petty, 1973), it is largely descriptive. Unfortunately, 
there are relatively few studies which directly document the 
effects of vocabulary instruction on reading comprehension. 

In studies that have examined the effects of vocabulary 
instruction on general measures of reading comprehension <i.e., 
standardized tests), the findings have not been encouraging(e.g., 
Currie, 1963; Hafner, 1965; Lieberman, 1967; Otterman, 1955). 
Only Currie reported a significant comprehension - effect. 
Students who completed exercises in classifyingwords according 
to similarities of structure and meaning had hi<:her compre- 
hension scores than did control students. Teaching students 
vocabular:^ through context aids(Hafner, 1965), word-roots and 
prefix meanings (Otterman, 1955), or concept development 
(Lieberman, 1967) appeared not to affect readingcomprehension 
as measured by standardized tests. 

When researchers have studied the effects of vocabulary 
training that focused on specific words and specific passages, t:he 
results have, been' equally disappointing. In a series of 
experiments, Pany a'^d Jenkins (1978), Jenkins, Pany, and 
Schreck (1978), and Pany (1978) evaluated the effects of several 
vocabulary instruction procedures on a variety of measures. The 
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instructional procedures included drilling on synonyms, telling 
word meanings in the context of oral reading, r ' iting words to 
common experiences, and providing praictice in applying word 
meanings. Based on a number of vocabulary measures, 
synonym drill was consistently the most effective instructional 
"jpTocMuix^ 'The differe-n 

procedures on vocabulary acquisition were generally mirrored 
on measures of sentence comprehension (Jenkins et aL, 1978; 
Pany, 1978). In contrast, vocabulary training producea no 
discernable effects on the comprehension of passages coniammg 
the words, which had been taught whether comprehension was 
measured by questions, cloze, or retell. 

These results were partially supported in research 
conducted by Ktimeenui and Carnine (Note 3) who compared 
reading comprehension effects from two vocabulary training 
procedures. For one group, tnrning consisted of pracMce on 
synonym meanings and wort application, A second group 
received this training plus integration instruction in which iht 
student was stopped during oral reading and asked to generate 
meaning for those sentences which contained a newly loarncd 
vocabulary word. These training groups then read a test p.:ss;»;ge 
which contained the new vocabulary. Their comprehension vvas 
compared with a control group which read the same test passage. 
On literal comprehension questions there were no differences 
among groups, but on inferential questions the contro' gro'.p 
which had not received vocabulary t.. lining performed sig^.if- 
icantly worse than the training groups, whose perfoi iT^ai ce was 
equivalent. 

In summary, while a variety of methcjds are capable, of 
teaching vocabulary and of affecting. comprehension at ne 
sentence level it is not so easy to affect overall r^ad'ng 
comprehension. The effects of vocabulary knowledge on reading 
comprehension .seem to be far more subtle than matiy leading 
educators had imagined. . ,^ 

Cloze 

The cloze task has become a common measure of reading 
comprehension. In this task students are asked to supply words 
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that have been deleted from a passage. The number pf correctly 
supplied words serves as a reading comprehension score. Cloze 
presumably taps a student's i\hvMy to use available linguistic 
information to construct meaning ior a pa'-^age. The rationale 
for using cloze as an instructional activity is th.i: students become 
jsensitive..to the semantic and syntactic cuesjn texi and learn to_ 
"test" the sensibility of the text produced iV'^m their word 
substitutions. 

What are the effects of providing students with practice on 
cloze tasks? In the studies that have examined this procedure, few 
have noted improvements in reading comprehensi^'u. Several of 
these studies (Blumenfield & Miller, 1966;Guice, 1969; Heitzman 
& Bloomer, 1967) used older subjects and thus fall outside the 
scope of this report. However, two studies which used sixth and 
seventh grade youngsters ,outhan, 1965; Schneyer, 1965) also 
report nonsignificant comprehension differences between cloze- 
practice and control groups. Neither experiment used a cloze 
comprehension measure. In contrast to these findings, Kennedy 
and Weener (1973) reported enhanced comprehension after 
practice with cloze. They gave third grade remedial students cloze 
exercises which were presented either through a reading or 
listening mode. Compared to a non-instructed group, the cloze- 
practice group scored significantly higher on the Durrell 
Listening Test and on reading and listening cloze measures. On 
the Durr^U Reading Test, the cloze-trained group exceeded the 
performance of all other groups. What is remarkable about 
Kennedy and Weener's study is that any e - were found, given 
the limited amount of training. Cloze pr: :e consisted of only 
100 items and no more than 1 hour'and40 minutes of instruction. 

Samuels et al. (L974) developed an instructional proce- 
dure which is distinrf from other cloze research. They analyzed 
the skills involved in completing cloze ' exercises into seven 
components, including training in the use of auditory and 
visual context to predict words which could logically 
follow in a sentence. Samuels et al, termed this procedure 
''hypothesis/test training/' Groups trained in hypothesis/ 
test consistently outpei iormed control groups on cloze 
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measures of comprehension (Dahl, 1979; Samuels et al., 1974; 
Sindelar, Note 4). On stand: .dized Veading comprehension 
measures, Dahl reported differences which favored the'hypoth- 
esis/test group while Sindelar found no significant differences, 
Samuels et aL and Dahl attributed their findings of improved 
•comprehension to the;children Vine. oased automaticity, .They- 
believe that the children receiving hypothesis/ test training 
became adejpt at wok I recognition using partial rather than whole 
word cues. 

The suits of studies employing cloze as an instructional 
technique, while far froi^" consistent, suggest that certain uses of 
cloze may benefit* reading comprehension. The hypothesis/ test 
procedure some variation thereof, which involves training on 
the subskills which presumably contribute to cloze proficiency, 
may be the key to effect i^'e cloze instruction. 

Organizational Stra.tegies During Reading ; 

Research on readability indicates that a number of text 
characteristics influence comprehension (Bormuth, 1966), Sev- 
eral studies have evaluated the effects of syntactical and semantic 
factors on children's understanding of and memory for text 
(Bormuth; Manning, Carr, & Pearson, 1970; Lesgold, 1974; 
Pearson 1974-1975). However, research on training children. to , 
understand syntactical structures is sparse. 

V In one study, Weaver (1979) attempted to influence 
reading comprehension by teaching children to encode text in 
meaningful chunks. Shetraitied average and above average third 
grade readers to unscramble sentence anagrams as a means of. 
teaching children to chunk meaningful units within sentences. 
Her result.s were irnpressive; after an average of 3'/: hours of 
practice on sentence anagrams, experimental subjects signif- 
icantly outperformed untrained controls 6n speed and accuracy 
of unscrambling anagrams, on cloze paragraph performance, 
and 01. a prompted verbatim sentence recall test. Hdwever, 
difference^ were not found on the ot ly .standardized reading 
comprehe'Msion test used (mat Reading Subtest) and on a timed 
test iri!.'«'=i': in^ recognition of meaningful sentences. 
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Another training activity which is thought to facilitate 
"chunking" of information is practice in sentence combining. In 
general, sentence-combining exercises are designed to increase 
students' awareness of larger linguistic units and thus increase 
language fluency. In two experiments (Fisher, 1973; Hughes, 
1525)vSludents were given sets of sentences to combine in writing 
(e.g., "My brother tore up my book" and "My brother is little" 
results in "My little brother tore up my book"). After five hours 
of instruction, fifth, seventh, and ninth grade students outscored 
controls on the Stanford Paragraph Meaning subtest and a cloze 
test(Fisher, 1973). In contra Hughes (1975) provided sentence 
combining training for low, middle, and high ability seventh 
graders. After a minimum of 26 hours of practice, there were no 
significant differences between- experimental and control groups 
on a cloze test or the Gates-MacGinitie subtests for SpcoH and 
Accuracy; but on the Miscue Inventory (Goodman & Burke, 
1972). experimental groups outperformed controls on comprc- 
hensio 1 and grammatical strength. Hughes observed that the 
greatest gains in reading comprehension from sentence, com- 
bining occurred in the lower and middle ability groups. 

Research Qn teaching organizational strategies, although 
limited, holds promise for improving reading comprehension. 
Activities which enhance "chunking" of inforinat'ion appear to 
affect students' reading behavior. 

Specific Subskill Instruction 

While reading researchers debate the subskill vs. holistic 
nature of the reading process, daily reading instruction in schools 
■ is typically approached from the subskill perspective (Jenkins & 
Pany, in press). Commercial reading materials generally offer 
activities designed to teach such comprehension skills as "finding 
the main idea,'' "identifying the sequence," and "drawing 
inferences." "Instruction" is essentially a matter of discovery 
learning through workbook exercises and teacher questioning, 
rather than through direct rule learning, rule application, or task 
analytic procedures. Sey&ral studies have addressed the teaching 
of specific comprehension subskills, and these have been based 
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on a task analytic direct instruction approach. For example, 
Carnine arid his associates have used these procedures to teach a 
number of subskills ranging from simple to complex. These 
include'answering sequence and simple fact questions (Carnifte,, 
Prill, & Armstrong, Note 5), using context to derive the meaning 
of unknown words (Coyle, Kamecnui, & . Carnine, Note 6), 
comprehension of reversible passive voice and clause con- 
structions (Kameenui, Carnine, & Maggs, Note 7) and selecting 
information needed to form conclusions (Woolfson, Kameenui, 
& Carnine, Note 8). 

Another comprehension subskill is "critical reading," 
Robinson (1964, p. 3) offers a general definition of critical 
reading as "judgement of ihe veracity, validity, or worth of what 
is read, based on sound criteria, or standards developed through 
previous experience," Wolf, King, and Huck( 1968) report scores 
on the Ohio State University Critical ReadingTest which favored 
groups who for one year received lessons involving reading, 
discussing, and evaluating printed materials and completing- 
logic worksheets, Nardell^ (1957) reported that students who 
recei'-cd instruction made significantly greater gains on recog- 
nition of propaganda devices, but not on interpnMing authors' 
suggestions or .characters' feelings. Students ..iio used work- 
books and audio tapes to learn logic rules (Lowerre &Scandura, 
1973-74) exhibited significant growth on a test which measured 
ability to recognize valid inferences, to detect statements 
incompatible with the premises, and to detect conclusions that 
were /ot logically permitted by the premises. In three 
experiments- designed to teach children to contend with 
determinate and indeterminate syllogisms in text, Katzenmeycr 
and Van Blaricom (1976) report "... that although the results 
were statistically significant [on one of three experiments], there 
were no major improvements in scores due to instruction [on 
syllogisms]" (p7 5). 

I In summary, it appears that students can acquire specific 
comprehension subskills if they receive carefully desii,/>ed 
instruction. A question that is yet unanswered is whether mastery 
of specific subskills affects overall comprehension. None oi the 
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studies reported in this section offer evidence that children who 
receive instruction in a particular skill area demonstrate 
improved comprehension on noncontrived texts. Do stud .nis 
who learn to comprehend passive voice constructions in specific 
exercises apply that skill in more natural context? Do students 
who receive specific subskill training recognize propaganda or 
passive voice constructions or pi-dblem statements when they 
encounter them in other contexts? A serious void in the 
comprehension subskill research to date is the failure of 
researchers to assess transfer effects to other reading compre- 
hension skills and to more general measures of comprehension. 
While it appears that specific comprehension skills can be taught, 
how this accomplishment relates to more general reading 
comprehension ability has yet to be established, 

Auding-Reading Relations 

One basis for designing reading comprehension instruc- 
tional strategies is provided by the developmental model of 
auding and reading described in the work of Fries (1963) and 
Sticht and his colleagues (Sticht, Beck, Hauke, Kleiman, & 
James, 1974). According to this comjprehension model, the same 
language competencies and cognitive content which permit one 
to understand spoken messages (auding) are also responsible for 
the understanding of written messages (reading). Thus once u 
person can decode print with sufficient ai; :omaticity, his/her 
comprehension of text will be equal to that which would be 
achieved if the message were processed auditorally. 

In one sense, research on the auding-reading relationship 
falls outside the organizational frames :)rk of this chapter. In 
theory, all instructional variables could be examined in relation 
to the aud-read model. However, since auding-reading research 
has largely examined variables which ciMow the reader to simplify 
the linguistic input of passage, we are including it in this section. 

Sticht et al. (1974^ offer four lines of evidence which 
support predictions from their auding-reading model. First, the 
ability of children to comprehend spoken messages should 
exceed their ability to comprehend written messages during the 
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early school years, presumably because beginning readers lack 
competency in decoding printed language. As decoding skills 
develop over time, auding and reading abilities will become 
equal. In general, the existing evidence related to this prediction 
confirms that auding is initially superior to reading and that the 
two processes become equally effective sometime around 
seventh or eighth grade. Second, auding ability will predict 
reading comprehension, once decoding skills develop. Data 
relating to this prediction shows that the correlations between 
these abilities increase from first to fourth grades and remain 
stable (around +.60) thereafter. Moreover, upper elementary and 
junior high aged students who are poor readers also perform 
badly on listening tasks, suggesting a general coniprehension 
deficit rather than one specific to reading (Sm ley, Oakley, 
Worthen, Campione, & Brown 1977; Becker & !Jlly, Note 9). 
Third, if similar language and cognitive comretencies are 
responsible ir- both auding and reading, then similar rates of 
information input, whether by printed or spoken language, 
should yield similar levels of comprehension. Evidence offered 'n 
support of this prediction reveals that the maximal ;ates Tor silent 
reading with accurate retention are similar to rates of both 
auding and speaking (250-300 wds./min.). Fourth, and most 
cruc ial for instruction in comprehension, the model predicts that 
training which improves auding ability will be reflected in 
improved reading ability at least for thdsc individuals who can 
decode print. Of twelve studies, reviewed py Sticht et al. ; 1 974) in 
which specific auding abilities wereJiD|/roved through training, 
ten reported improved reading ability which paralleled the 
improvement in auding ability. Those studies which reported an 
auding reading transfer tended to include such content as 
*nraining in listening in order to recall events, ideas, or 
details . . .training in vocabulary . . .training in listening to predict 
outcomes or io draw conclusions or inferences . . .[and] training 
in listening to follow directions" (Sticht et aL, p. 88). 

Another study bearing on auding/ reading t.ansfer was 
reported by Kennedyand Weener(1973)and previous!} reviewed 
in our section on ''Cloze.'' Compared to a noninstructed control 
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group, children who underwent clo/.c auduig training scored 
significantly higher on the Durrell Listening Comprehension 
test, on a cloze listening measure, and on a v. lo/.e reading measure. 

Imphcations of the developmental model of auding and 
reading for reading comprehension instruction are noteworthy. 
Once decoding is mastered, improvements in reading compre- 
hension should be a function of impiovements in the reader's' 
language and cognitive competencies. Efforts to improve; 
language and cognitive competencies can occur in the context of 
reading, but are not necessarily limited to reading activities. It 
would seem that instructional activities which expand on the 
child's world knowledge and on his ability to extract meaning 
from various syntactical forms of discourse would produce 
reading comprehension improvements. The aud-read model 
implies, further, that specific language competencies of children 
should be assessed, so that instruction could be geared to 
appropriate targets. If the aud-read model is valid, two 
observations are germane. Some ''reading comprehension" 
problems shouJd conveniently disappear as soon as children 
achieve proficiency in decoding, at least in children who have an 
adequate knowledge and language base. Thus, the neces^'ty of 
producing proficient decoders is obvious. Second, although 
instruction designed to improve cognition and language abilities 
may begin early, it should not be expected to have maximum 
impact on reading comprehension until decoding skillS; are 
reasonably proficient. 

Instructional Variabics Addressing 
Attention Related Problems 

Another source of comprehension failure involves the 
attention a reader gives to the task. We would expect 
comprehension to suffer if students processed text superficially, 
attended to unimportant aspects of the task for example, 
pronunciation, or failed to self-monitor their understanding 
and memory for what is read. Instructional variables which seem 
to addrc:s tlvj.^c attention related problems include questions, 
purpose settini:, incentives, and reading strategies. 
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Questions and Purpose Setting 

Establishing a purpose for reading and asking children 
questions about what they read are two frequently recommended 
' instructional procedures (Wcintraub, 1969), In a recent investi- 
gation of classroom reading instruction. Durkin (1978) found 
that teachers dcv.)ted the largest time allotment (18 percent) of 
the reading period to comprehension assessment, that is asking 
questions vbovi what was read. To our knowledge, there have 
been no systematic investigations of' the effects of teacher 
questioning on the tlcvdopment of children s comprehension. In 
contrast, there is an enormous literature on the effects of adjunct 
questions. in text and, there is some research on purpose setting. 
While most of this research has focused on mature readers, e.g.. 
college students, se veral recent • studies have used younger 
students. 

Providing children with purpose statements such as "read 
for implied meanings" or "read to find out how. . before they 
read a passage does not appear to facilitate comprehcn.sion 
(Ballard, 1965; Pettit. 1971: Snavely, 1962). Similarly, placing 
, questions before a reading passage does not consistently increase 
comprehension relative to no-question conditions (Fincke, 1968; 
Landry. 1967). In contrast, interspersed questions, either before 
or after pertinent passage segments, have sometimes been shown 
to aid children's subsequent test performance. These effects, 
however, are by no means consistent across studies. For example, 
Daugherty ( 1971) found that interspersed post-questions did not 
facilitate comprehension, while Swcnson and Kulhavy (1974) 
and Yost, Avila, and Vexler(l977) found that interspersed post- 
questions strongly facilitated comprehension. Swenson and 
Kulhavy also reported significantly higher | crformancc on 
relevant items (posttest information that was questioned during 
reading) vs. incidental items (posttest information not ques- 
tioned during reading). Richmond (1976) reports results that fall 
somewhere in the middle of these conflicting findings. His studv 
examined the generality of question effects' across reading 
selections. Three different passages were examined under two 
interspcrsed-question conditions (before and after) '^^c .i no- 
question control. In contrast to Swcnson and Kulhavy ana V ost 
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solutions usually involve teaching low performing students some 
skill or strategy or providing them with relevant background 
knowledge. For some children, however, poor cpmprehension 
may not ba.the result of poor instruction but rather lack of 
motivation' Several studies conducted in scho9l settings with 
normal reading materials have investigated / the effects of 
contingency management procedures. 

Lahey, McNees, and Brown (1973) and Jenkins, Barksdale, 
and Clinton (1978) report significant improvements in the 
number of correctly answered comprehension questions when 
such activity was reinforced by social praise and money, Lovitt 
and Hansen (1976) and Hansen and Lovitt (1976) have also 
modified reading comprehension with contingency management 
procedure^'. 1 n one study, permission to skip stories in the reading 
book was contingent upon a prescribed level of comprehension 
p. ^rmance. If the children performed beneath the prescribed 
i^^ ^hcy were required to reread the passages and correct their 
. J, When these contingencies were in effect, reading 
, rehension improved. 

This set of studies suggests that comprehension perform- 
ance can be altered through contingency management. Like 
adjunct questions, incentives have uncertain direct and indirect 
effects on reading comprehension ability. Our comments on this 
issue concerning questions and purpose setting are equally 
relevant here. 

Reading Strategies 

Whereas several studies have demonstrated that incen- 
tives and adjunct questions enhance learning from text, these are 
conditions imposed on the reader by an outside agent, usually a 
teacher or a curriculum writer. Itwould behearteningtodiscover 
that children could learn general strategies to apply by 
themselves thus reducing their dependence on outside agents. 
Among the attempts to teach students an internalized reading 
strategy^weJBcLude th e resear ch on imagery, paraphrasing, self- 
checking, reading strategy lesson^and corrective feedback. 

Pressley(1977) has reviewed rese^arch on induced imagery 
and prose learning. The basic paradigm of thi^Tesearch involves 
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instructing students in how to form mental images of the events 
specified by the text. The comprehension of students 
receiving this instruction is then compared to that of students 
who arc not so instructed. There appears to be a clear 
. developmental trend in children's ability to use. mental imagery 
\ with discourse materials. Children below the fourth grade usually 
\io not bejiefit from an instruction to form images, cither on 
liklening or reading tasks. But in one study Levin and Divmc- 
Hii^kins (1974) found that, compared to a no-imagery group, 
fourtl>grade students who were trld to use an imagery strategy m 
a li.sienim^i^sk scored higher on a recall measure. However, no 
comprehension improvements were noted for students who read 
the ."iame story with imagery instructions. 

Whereas younger students have difficulty ernploymg 
imagery, older students ( fifth and sixth graders) have been shown 
to benefit' from imagery instruction in school-like reading tasks 
(Kulhavy & Swenson, 1975; Rasco, Tcnneyson, & Boutwell, 
1975). However, younger children (e.g., third and fourth graders) 
who ordinarily do not be.nefit from an instruction to image may 
be taught an imagery strategy. Pressley ( 1977) gave, students 20 
minutes of imagery practice . with feedback regardmg the 
adequacy of their imaging. In artpther study, Lesgold, McCormicl , 
and Golinkoff ( 1975) trained children over several weeks to draw 
pictures of events described in passages and later to picture the, 
stories in their heads; The results of both studies showerf. 
improved comprehension for the imagery-trained subjects. 
However, in the Lesgold et al. study, students did not 
/"^spontaneously employ the imagery strategy, but had to be 
' reminded to use it. Moreover, there is evidence from this study 
and from a study mentioned- by Levin (1973) that an imagery 
strategy will produce improved performance only on certain 
■ kinds of reading selections, specifically those containing a large 
number of concrete references. 

These results raise questions about the general utility ol 
' imagery strat(^ies. As Pressley ( 1977) notes, even in the studies 
■ which report facilitation from induced imagery, the size of the 
. effects is'rather small. Moreover, there appear to be a number of 
individual difference variables which hinder some students' 



188 



Jenkins and Pany 



successful employment of an imagery strategy with prose, 
including mental retardation (Bender & Levin, 1978), decoding 
deficiencies (Levin, 1973), and ability to learn from pictures 
(Levin, Divine-Hawkins, Kcrst, & Gut-tman,' 1974). 

If learner and text characteristics exert serious limitations 
on the effectiveness of imagery strategies, are there other means 
by which readers might successfully create elaborations of text? 
Doctorow et al. (1978) devised a procedure for text elaboration 
which would seem not to depend upon the concreteness of text, 
rior on students' ability to learn from pictures. In their study, 
sixth grade students were instructed "to generate and to write 
their own sentence about what happened in the paragraph after 
they read each paragraph of the story/' Even though students 
received no practice or feedback regarding the adequacy of their 
sentence paraphrase, they apparently succeeded in performing 
this task. Compared to a reading-only control group, students in 
the paraphrase treatment correctly answered 43 percent more 
questions on a posttest and also completed 43 percent morecloxe 
items one week later. In a replication with lower ability students, 
the treatment appeared to produce even larger effects with 
performance improvements of 67 percent. Such promising 
results from so simple a manipulation call for replicati.on and 
extension to other reader groups and reading selections. 
Together with the research on imagery, the Doctorow et al. 
findings suggest that readers can improve their memory for text 
by producing mental or verbal elaborations of the events 
depicted in a passage. How the imagery/sentence generation 
strategy enhances comprehension or 'memory for text is not 
altogether clear. The strategies may be effective because they 
raise and maintain the reader's attention: A Iternatively, they may 
ensure deeper semantic processing of the information, or may 
require students to organize and relate the information in the 
text data to their existing knowledge. 

Teaching young readers how and when to s>^tematically 
employ various reading strategies may affect thei> success in 
certain specific reading -situations, for example, with highly 
concrete text. More generally, it would also seem to make them 
more independent and self-directed learners and more knowl- 
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edgeable about tHeir own comprehension processes. Brown (in 
press), describeSHthis phenomenon as the development of 
metacomprehension^'She has identified several skills which 
mature learners, those who are knowledgeable about their own :■■ 
comprehension and memory abilities, seem to use. In a recent 
study. Brown, Campionc and Barclay ( 1978) report an attempt 
to teach a self-checking skill to mentally retarded adolescents who 
normally perform poorly on memory and reading tasks. Students 
learned a rehearsal strategy for memorizing list information, NoK^ 
only did these students retain this self-checking, rehearsal 
strategy .over a long time, they also spontaneously generalized the 
strategy to prose materials. 

While Brown and Campione focused on what may be seen 
more as a study skill than a reading skill, the distinction between 
these areas is sometimes blurred. For instance, are imagery and 
sentence elaboration strategies reading or study skills? One might 
argue that in contrast to these reading/ study strategies, teaching 
children a general "reading for meaning" -orientation is a purer 
"reading" strategy. The work of Goodman (1967), a leading 
spokesman for a meaning emphasis as opposed to a skills and 
decoding emphasis in the teaching of reading, has prompted 
several studies on the feedback teachers give to students. In this, 
research, teacher-delivered corrections for oral reading errors 
either emphasized correct pronunciation and grapheme-phoneme 
reproduction or emphasized readingfor sense. In the former case 
a teacher might stop a student for every oral reading error, no 
matter how minor, and require the student to "sound-out" the 
correction. In the latter case, a teacher might tolerate and even 
encourage oral reading errors as long as the text modifications 
made sense and did not alter the author's intended meaning; if 
teachers did correct errors, they would encourage the student to 
supply a word which was consistent with the syntax and 
semantics of the sentence or witfi the overall passage. 

In gene--.i, this research suggests that the type of 
corrections teachers provide can shape students toward either a 
decoding orientation or a meaning orientation (Hansen,, 19.77;' 
Piper, 1975; Smith, 1974). However, most of these studies are 
guasi-experiments and, thus, the strength of their results- is 
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somewhat attenuated. In contrast, Niles, Graham, and Winstead 
(Note 10) recently reported a true experiment in which they 
contrasted code-oriented corrections of oral reading miscues 
with a "no-correction" condition. After only four days, fourth 
grade students appeared to read differently as a function of these 
feedback variations. On an orally read test passage, students in 
the no-correction condition made significantly fewer meaning 
change miscues and also exhibited higher retell scores. Pany, 
McCoy, and Peters (Note 11) conducted a study of the same 
instructional conditions as Niles et aL, but obtained different 
results. Remedial reading students assigned to the corrective 
feedback and no-correction conditions did not differ m 
answering story-referenced comprehension questions. However, 
higher functioning remedial students obtained superior retell 
scores under the no feedback condition. 

The findings of Niles et al. are further attenuated by 
results of an experiment reported by Fleisher (1979). She found 
no differences on comprehension measures of students taught 
under the following conditions: 1) corrective feedback for each 
oral reading error; 2) no corrections during reading, but feedback . 
on answers to comprehension questions; and 3) a combination of 
corrective feedback on' oral reading errors ^///r/ comprehension 
questions. 

Given these conflicting yet intriguing findings, the issue of 
reading strategies and teacher influence on students' strategy 
development is an area which deserves far more attention from 
researchers. 

The Current Status of Reading Comprehension Instruction 
' We have attempted with this review to characterize the 
research basis for various instructional practices in the teaching 
of reading'comprehehsion. Given this goal, one might expect us 
to be in a position to provide some guidance to teachers and 
curriculum writers about how they should proceed in designing 
and implementing instruction that will enhance t]ie development 
of reading comprehension. In our opinion, the current level of 
knowledge on this topic does not justify many strong prescriptive 
statements. 
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A somewhat more optimistic appraisal can be made by 
focusing on findings from specific studies. While they may not 
permit our making strong directives to practitioners, certain 
findings at least provide promising leads for subsequent 
investigations. In - the ^category of background knowledge 
variables, the research on pictures (Schallcrt, in press) may have 
direct implications for instructional design, and the research on 
paragraph headings (Doctorow et aL, 1978) would appear to 
warrant more attention from researchers, In„the category of 
linguistic/ reasoning variables the research by Dahl (1979) on 
repeated readings and hypothesis-test training is noteworthy. 
Weaver's (1979) "chunking" trainingand the sentence-combining 
exercises may be effective in developing syntactic knowledge. 
The task analysis and direct instruction approach represented by 
the research of Carnine (Note 5) and his colleagues should be of 
significant interest to teachers and curriculum writers, because 
this approach appears to succeed in teaching the very skills that 
are often the target of classroom instruction. Finally, within the 
category of attentional/^processing variables, the research on 
paraphrasing (Doctorow et al,, 1978) seems to hold the greatest 
promise. 

On the other hand, the research on cloze practice, advance 
organizers, purpose setting, speed of single word decoding and 
irr.agery suggests that these variables either do not enhance 
comprehension.to any detectable degree, or do so in extremely 
limited situations (cf. our discussion of imagery),' 

Stepping back from specific findings to a more general 
overview, we see that while the literature on comprehension 
instruction does not answer the big question on how to make 
children better reading comprehenders, it does raise several 
interesting questions. With the growth in schema-theoretical 
explanations of comprehension, we need to ask how the notions 
of world knowledge and schema can be translated into 
instructionally useful concepts. Recognizing that some first 
attempts such as advance, organizers have not been successful, 
researchers might rethink the procedures by which educators can 
identify particular abstract schema or general frameworks which 
would then be directly and deliberately incorporate^ into the 
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curriculum. Another relevant area, metacognition, involves 
strategic behavior for reading situations involving relatively or 
. totally unfamiliar topics, where obvious schemata are absent. 
Possibly strategies that involve deliberate semantic processing, 
such as paraphrasing, along with other strategies more directly 
related to remembering, such as systematic rehearsal, are most 
appropriate in situations where background knowledge is 
lacking for a particular"topic-(Pace, Note 12). 

There are other questions which have almost no 
relationship to theory. Nevertheless, research on these issues 
might yield some very useful information for practitioners For 
example, it would be of interest; to examine the effects of the 
multitude of workbook exercises that are assigned to children m 
the name of comprehension instruction. Do any or all of these 
different exercises teach anything and, if so, is that "anything 
related to reading comprehension? Some very practical experi- 
ments could be designed to address'this issue, and their results 
■ could have obvious and direct implications for classroom 

practice. ^ r ■ . ..^ 

Another recommendation we make for future mstruc- 

tional research is related to the choice of dependent measures. 

Many of the studies reviewed' above describe tbeir outcome 

measures in very general terms, for example, as factual questions 

and inference questions. The failure to employ umform measures 

may in part, account for inconsistent findings across studies. It 

might be wise to consider also incorporating multiple measures 

of comprehension into any studies of instructional .variables. If 

multiple measures are not included, the researcher may 

erroneously conclude either that the instructional treatment did 

not affect reading comprehension when indeed it did, or that the 

instructional treatment had widespread impact on reading 

comprehension, when in fact it did not. 

One final comment. We were struck by the increasing 

volume of research on reading comprehension instruction. 

However, this research is distinguished more by its breadth (a 

great diversity of variables are represented) than by its depth. 

That is relatively few variables have been studied on more than 

one occasion by more than one investigator. It is regrettable that 
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some of the most promising variables, in terms of their apparent 
effects, have been studied on only a single occasion (e.g., Dahl, 
1979; Doctorow et al, 1978; Weaver, 1979), The implications 
of this observation are obvious. Researchers in reading need to 
establish strong, credible, and generalizable findings if teachers 
are to be helped in their attempts to foster children's reading 
development. 
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Academic Learning Time and Reading 
Achievement^ 



David C. Berliner 
University of Arizona 

If one is willing to accept the standardized reading achievement 
test score as a criterion for effective instruction, then there is now 
sufficient evidence to say that some of t|\e variables associated 
with successful instruction in reading at the elementary grades 
are known. To interpret this evidence requires familiarity with 
just a handful of concepts: allocated time, engaged time, success 
rate, academic learning time, opportunity to learn, content 
coverage, curriculum-test congruence, and direct instruction. 
These conceptsj/as they apply to elementary grade rea-iwu^ 
instruction, arex^^scribed in more detail below. 

Allocated' Time 

Table 1 presents data describing time allocations in 
reading and language arts activities from 25 second grade and 21 
fifth grade classes. (The complete study, including descriptions cf 
the sample, instrumentation, and data analysis, is reported in 
Fisher, Filby, Marliave, Cahen, Dishaw, Moore, & Berliner, 
1978b.) These data are based on records of classroom'activity 
kept for most of the school days from January to May of a recent 
school, year in a sample of schools from the San Francisco, 

>This paper is based, in pari on the results of a mulli-ycar study called the Beginning 
Teacher Evaluation Stiidv. The study and papers issued from the project were a joint 
effort of David C. Berliner. Charles W. Rshc-r. Nikola N. Filby. Richard Marliave. 
Leonard S. Cahen. Marilyn Dishaw.-and J.effry E. Moore. 
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California, Bay area (Fisher, Filby, & Marliave, 1977). What is 
immediately obvious from these data is the enormous variability 
of allocated time for reading and reading related activities. In the , 
second grade, in classroom 4, students are provided with a mean 
of 47 minutes a day for reading. In contrast, similar students in 
classrooms Hand 17 each received an average daily allocation of 
1 18 minutes -for reading. The same .phenomenon is found in the 
fifth grade data. Students in classroom 6 received a little over an 
hour a day for reading, while in classroom 10 and classroom 12 
the average student is exposed to 2 hours and 17 minutes of 
reading instruction and related reading activities. These very 
large differences in the duration of reading instruction in 
different classes are great enough to affect scores on standardized 
reading tests (Fisher et al., 1978b). It is difficult to. determine' 
what the "proper" or "upper" limits of allocated time for reading 
should be in the elementary grades. But it is nut difficult to 
believe, from these data, that some teachers have allocated too 
little time for reading instruction. 

Another item of interest in the data of Table 1 is the 
standard deviation of daily allocated time in reading. In the fifth 
grade classes, class 8 and class 18 had almost identical means for 
allocated reading time, but they show quite different standard 
deviations. The managerial behaviqr of the teacher in classroom 
1 8-was such that very little variability from a set routine is shown. 
In class 8, m'uch more variability in the daily p^attern of classroom 
organization is shown. Such differences in style of classroom 
management also affect achievement. The large between-class 
differences in standard deviations and total allocated time 
sometimes occur because.transition time (the time students spend 
.finishing one activity, moving, and getting ready for another 
activity), wait time (the time students spend waiting for help or 
directions), and time spent in behavioral management in some 
classes is very high or variable from day to day. These data 
indicate -that the time allocated for academic instruction in a 
school day can easily slip away when a4eacher cannot keep the 
transitional time, wait time, and behavioral problems to a 
minimum. Any sensible manager know? that. Somehow, 
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Table 1 

Mean Time and Standard Deviation (in Minutes) Devoted to Reading 

and Language Arts Activi ties by Class and by Grade 

Fifth Grade 

Class Time 

1 102 (3) 

3 80 (2) 

4 92 (4) 

5 110 (8) 

6 68 (7) 
. 8 .104(13) 

9 127 (4) 

10 137(11) 

11 119 (5) 

12 ' 137 (9) 
14 • 130 (4) 

16 95 (8) 

17 108 (15) 

18 106 (2) 

19 108 (6) 
21 129 (9) 

23 ^ 121 v'6) 

24 95 (10) 

25 129 (3V' 

26 102 (8) 

27 * 88 (3) 



Second Grade 


V^laSS 


Time 


I 


L 106 (3) 


2 


79 (8) 


3 


(4) 


4 


47 (8) 


5 


96 (5) 


6 


65 (5) 


7 


93 (7) 


8 


75 (6) 


9 


90 (1) 


10 


93 (6) 


.11 


118 (7) 


12 


83 (5) 


13 


87 (1) 


14 


103 (6) 


15 


96 (3) 


16 


66 (3) 


17 


118 (3) 


18 


101 (4) 


19 


90 (2) 


20 


80 (4) 


21 


82 (4) 


22 


86 (6) 


23 


85 (6) 


24 


90 (2) 


25 ' 


78 (5) 



Sample 



Source: Fisher. Filby. & Marliavc. 1977. 



1Vjj.,i, - Total 

88(16) Sample ^4 (11) 



however, in many classes, there is a lack of attention to classroom 
management that ^results in considerable inefficiency, and 
reduced achievement on st£.idardized tests of reading. 

When studying instructional time it is also important to 
know how much time is allocated to>articular areas of the 
reading curriculum. Table 2 presents. such data for fdur fifth 
grade classes. These data were collected from teacher logs kept on 
a sub-sample of students in the class, over about 90 days of 
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instruction. Considerable variability in the allocations of time to 
particular content areas of fifth grade reading is shown. (Similar 

: results were found at other grade levels and for other subject 

[ matter.) 

Different philosophies of education result in, different 
beliefs about what is important for students to learn. These 

.beliefs, along with the teacher's likes and dislikes for teaching 
certain areas, result in some interesting differences in the 

///m7/o/7a/ curriculum of a class. For example, from Table 2 it can 

>e seen that classroom C spends dramatically more time on 
comprehension in reading than any of the other three fifth grade 
classes. In classroom D silent reading and spelling were 
emphasized, as judged from the dramatically greater allocation 
of time to those content areas, in contrast to the ayerage amount 
of time each student of classes A, B, and C received. And oral 
reading hardly seemed to be of interest to theteacher of class B, at 
least that is what can be concluded when the data from class B are 
compared with the data from the other fifth grade classes. 

These rather significant differences .in the functional 
classroom- curriculum do result in considerable differences in 
achievement (Fisher et al., 1978b). If students in these fifth grade 
classes were part of some end-of-year statewide testing program, 

' where drawing inferences from paragraphs of prose was tested, as 
it often is, one might well expect that students in classroom C 
would show superior performance when contrasted to similar 
students in the other fifth gra^e classes. 

This brief examination of selected data presenting 
estimates of classroom allocated time shows ^clearly that some 
teachers spend considerably more time instructing in particular 
reading content areas than other teachers, and some teachers 
allocate considerably more total instructional time to reading 
than do other teachers. These differences, pot into experimental 
terminology, represent clear differences in the type and in the 
duration-of treatment. And we should expect that when type of 
treatment and duration of treatment are varied, achievement will 
vary. Our data confirm this. Other things being equal, the more 
time allocated to a content area of reading, the higher the 
academic achievement in that content area. 
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Table 2 

Allocated Pupil Time (in Minutes) in Content Areas of Readmg for 
Four Fifth-Grade Classes 



Word 
struc- 
ture 



Curriculum Content Areas 



Classes 
B C 



Root words and affixes 
Syllables 



250 
67 



112 
60 



126 
102 



D' 



103 
212 



Synonyms 
Word Pronoun reference 
meaning Other word meaning 



95 
0 

558 



152 
0 

949 



10 
9 

1042 



^ Verbatim (no rephrasing) 
Translation (paraphrase) 
Compre- Inference/ Synthesis 
hension Identifying main items 

Evaluation of fact and opinion 
Other comprehension 



206 
122 
235 
153 
5 
196 



329 
151 
252 
243 
0 

325 



188 
1649 
1432 

943 
66 
1368 



119 
56 
6!5 



325 
383 
306 
326 
56 
239 



Oral reading 
Reading Silent reading 
practice Rea(iing in content areas 



Related 
reading 
activi- 
ties 



604 63 
1083 724 
505 256 



885 
956 
400 



Spelling 
Grammar 
Creative writing 
Study skills 
Other 



694' 
243 
56 
472 
207 



847 
183 
343 
669 
687 



664 
859 
98 
270 
1317 



Sources: Dishaw, 1977a; Dishaw, 1977b; Filby & Mnrlia\e. 1977. 



305 
3640 

284 



1415 
413 
573 
171 
426 



Engaged Time 

Table 3 presents data on. the average percentage of time 
students are engaged during reading instruction in the four fifth 
grade classes described in Table 2. These data are from observer 
records and not from teacher logs. Previous work revealed that 
teachers can keep accurate records of allocated time, but that 
classroom observers are necessary to obtain accurate records of 
engaged time (Marliave, Fisher, Filby, 1977). In examining these 
data it appears that the percentage of tinr?e students are engaged is 
relatively high. This is an artifact of the observational system that 
was in use. The observation system required that transition time 
and certain other classroom phenorhena be coded as separate 
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events. Thus, the data on engagement rates are for the time spent 
in reading, after a class has settled down and before the class 
starts to put their work away. If enj;agement were coded for the 
entire time block denoted by teachers as reading time, the 
engaged time rates would be considerably lower because during 
transitions or when waiting for help students are usually not 
engaged. Still, variability between classes is noted for this 
important variable. The engorgement rates in these four fifth- 
grade classes vary from 75 percent to 84 percent during reading 
instruction. This rang;:; was much larger in the total sample of 
classes studied. 

The average number. of minutes per day allocated for 
instruction, multiplied by the engagement rate, provides liberal 
estimates of the number of engaged minutes per day, per student. 
These data are found in Table 3. The range in these four classes is 
between 48 and 119 minutes per day of engaged time. These are 
dramatic differences, differences of lOO percent or more, in the 
engaged time students allot to learn their reading. And these 
differences in engagement have been consistently related to 
d ifferences in achievement. There is nothing very startling here. If 
stud'^nts do not pay attention, they do not learn much. 

In mosii: districts we may assume that a .school year is 
about 180 d-ays. This figure must be reduced by absences of 
teachers and students, strikes, bussing difficulties, the diffi- 
culties of instruction before. Christmas and Easter breaks, the 
testing at the beginning and end of the school year, and other 
factors. A reasonable, perhaps even a liberal, estimate of the 
"functional" school year may be about 150 days. Accumuhiting 
the engaged minutes per day over these 1 50 days gives an estimate 
of the engaged instructional time allotted by students to the 
academic curriculum during the entire school year. Table 3 also 
presents these data. In the four fifth.-grade classes, with 
reasonably mature and independent learners, between 120 and 
298 cumulative hours per school year are noted for all areas of 
reading. It is worth noting that these teachers were volunteers 
who were open enough about their teaching to permit regular and 
extensive observation of their classes. Thus, we estimate that the 
data on engaged time are markedly higher in the sample than in 
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Table 3 



Engaged Time, Percent of Time Students Are Working with Easy^Material 
and Academic Learning Time for Four Fifth-Grade Reading Classes 



Classes 



Percent of time students engaged 
Engaged minutes per day 
Percent of time students are in material of 
easy difficulty level 

Academic learning time per day in minutes 
Engaged hours per 150 day school year 
Academic learning time, in hours, per 150 
day school year ' 



A 


B 


C 


D 


82 


77 


^ 84 


75 


48 


51 


119 


101 


5! 


61 


47 


58 


24 . 


31 


56 


59 


120 


128 


298^ 


283 


60 


78 


140 


148 



the population of interest. We have reason to suspect that in 
many fifth grade classes cumulative engaged time in reading is 
well under 100 hours for the entire school year. With younger 
children, say .econd graders, the total time students are engaged 
in the reading curriculum for the entire school year is 
considerably lower, and just as variable. We have reason to 
suspect that in many second grade classes cumulative engaged 
time in reading is well under 70 hours for the entire school year. 

As t'hese data come to light some important questions 
must be asked. For example, what should be expected in the way 
of engaged time for 30 students and one teacher, working 
together throughout the school year? What are the expectations 
for instructional time held by parents and'school board members 
as they make policy to educate the young of a community? 
Because these new estimates of classroom allocated and engaged 
time do not conform to the prevailing beliefs that exist among 
people who manage and support education, either those beliefs 
must be. changed or instructional practices must be altered. 

Success Rate 

Three rather broad categories were used in the Beginning 

■' Teacher Evaluation Study to define the difficulty level of the 
material or activities that were worked on by students (Fisher et 
al 1978b). In "high success," the student understands the task 
and makes only occasional careless errors. In "medium success," 
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the student has partial understanding but makes some substan- 
tive errors. 1 n "low success," the student does not understand the 
task at all. These categories coincide with common sense notions 
of "easy," "medium," and "hard." 

High success rate. The findings consistently point out the 
positive effects of school tasks yielding high success rates (easy 
materials, providing a low error rate). Other research on 
instructional design has stressed the importance of high success 
rates. High success rate in scholastic activities has also been 
found to be one factor that contributes to high levels of student 
self-esteem. 

The average student in the Beginning Teacher Evaluation 
Study spent about half the time working on tasks that provided 
high success. Students who spent more time than the average in 
high success activities had higher achievement scores in the 
spring, better retention of learning over the summer, and more 
positive attitudes toward school. From these data, one might 
recommend that students spend somewhat more than half their 
time on tasks they can carry out with high success. Sixty or 
seventy percent might be reasonable. 

The idea of success rate may be more understandable if 
one thinks about the cyclical nature of learning. Learning is 
a process of moving from not knowing to knowing. Most likely, 
when new material is introduced the student will not understand 
completely and will make some errors. Guided practice and/ or 
explanation help the student understand, and she or he comes to 
make fewer errors. Eventually, the student will perform 
correctly, although probably with some effort. Learning will 
become well established and further work will be practice or 
review. This stage could be viewed as one of consolidation. At 
some later point, the student knows the material so well that 
further practice is of-minimal value and it is time to move onto 
something,new. The results of the Beginning Teacher Evaluation 
Study suggest that for learning of basic skills in the elementary 
grades, the stage of successful practice (consolidation) is 
particularly important, so that concepts and procedures are^ 
thoroughly mastered. Apparently some teachers do not devote 
sufficient time to this stage. . 
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\ While these data lead one to emphasize the importance of 
giving students ample opportunity for successful practice, one 
must also point out that it would not be desirable for students to 
spenci all of their time on tasks they can perform completely ., 
correctly. Common sense suggests that too high a rate of ^*high 
success" work would-be deleterious (boring, repetitive, time 
wasting, etc.). Probably, some balance between "high success'* 
and/more challenging work is appropriate. Also, it was found 
that older students and/ or students who were generally skilled at 
sch<)ol learning benefited from a smaller percentage of time at the 
high success level. Apparently these students had learned to 
problem solve, to take a task they did not completely understand 
arid work it out. These students may enjoy the challenge of more 
difficult materials, as longaUhey eventually experience success. 
Lo\v success rate. When students worked with materials 
' or "activities that were categorized as "hard," yielding a low 
success rate, achievement was lower. In the Beginning Teacher 
Evaluation Study, no teacher assigned a high proportion of 
materials that were exceptionally hard for students. However, 
some students worked on materials judged to be excessively 
difficult for them as much as 20 percent of the time. Other 
students never worked at a low success rate. Students who wecc 
observed to spend more time on excessively difficult materials 
generally learned less than other students. It is seldom, if ever, 
desirable for students to be given tasks where they experierice low 
success. '-^ ^ 

Academic Learning Time / 

Academic Learning Time ( ai.t) was the research variable 
of most interest in the Beginninj^Teacher Evaluati6n Study. This* 
variable is defined as the time a student is epgage^ with academic 
materials or activities that yield a high ^liccess rate. One 
component of alt is the engagement rate of>tudents. Another is 
the level of^difficulty of the material that is attended to by a 
student. Theoretical and empirical evidence suggests that 
classroom learning occurs primarily witli materials that are of an 
easy level of difficulty. Materials that' are too hard for a student 
do not add much to his or her acquisition of the concepts, skills. 
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and operations that are required of students in a particular grade 
level Nor do they allow for practice, repetition, and over- 
learning. These are important concerns if retention is to be 
maximized. Table 3 pre.>;ents information on the peicentage of 
time that students are working with relatively easy material. 
These data are^atings made by observers in classrooms. As 
shown in Table 3, for fifth-grade reading the range is between 47 
percent and 61 percent. Multiplying the engaged minutes perday 
by the percent of time students are assigned work that yields low 
error rates provides an estimate of alt per day. These data are 
also provided in Table 3. 

As noted above, the typical academic school year ol 18U 
days may be considered to be a functional school year of 150 
•days. The last line in Table 3 presents academic learning time, in 
hours for a school year of 150 days, in fifth grade reading the ^ 
range'is from 60 hours per school.year to 148 hours per school/ 
year in these four classes, differences of many hundreds of 
percent in accumulated *^*ai.t are noted. In the total samfile 
studied, the range of .ai.t is considerably larger, it should again be 
noted that all the elementary school teachers in this saniple were 
\'olunteers. These data, if they could be obtained from a non- 
volunteer sample, would most likely show even more between 

class variability. / . . 

1 f academic learning time is a major factor in acquiring the 
knowledge and skill required to master the curriculum of a 
particular grade level, for a particular content area, one can see 
that the school year does not contain as/much alt as might be 
desired If our concerns about instructibn are correct, there are 
many many classes where there is not/sufficient time for students 
to* master the curriculum that has'been chosen for them. The 
implications of this situation fpK learning are quite important 
and are discussed next. 

Academic Learning Time and Achievement 

A major finding of the Beginning Teacher Evaluation 
Sf Jdy is that increases in Academic LearningTime are associated 
with increases in student achievement. The practical importance 
of Academic Learning Time in relationship to achievement is 
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illustrated in Tabic 4, using examples from the analysis of grade 
two reading instruction (Fisher, Berliner, Filby, Marliave, 
Cahen, Dishaw, & Moore, 1978a). This table displays total 
feeing scores in October, student engaged time with a high 
success rate (Academic Learning Time) in reading over the 
period from October to December, and estimated total reading 
scores in December. The reading scores in December arc 
estimated from linear regressions that use the test scores in 
October and the intervening Academic Learning Time to 
estimate the expected December test scores. Total reading scores 
in October and December are shown both in terms of raw scores 
out of 100 items (percent of items correct) and in terms of 
percentile rank among students in this study. Student engaged 
time with a high success rate (Academic Learning Time) in 
reading is "shown both in terms of total time over the five week 
inter-test period and in terms of the corresponding average daily 
time (in minutes). 

Reading across the rows in this table from left to right, one 
can see that given a particular reading score in October (two left 
columns) and a particular amount of Academic Learning Time 
(middle two columns), what the estimated reading score in 
December is (two right columns). It should be recogi\ized, 
however, that students who started with a given score in October' 
and experienced a given amount of Academic Learning Time 
during the intervening period did not always attain the same 
score in December. Therefore, it was necessary, for the puposcs 
of this V^^p' to riv.. at some estimate of a "typical" or expected 
score in December, give n certain initial scores and some amount 
of Acadehlic Learning T nc. In Table 4, the estimated December 
raw scores have a standard error of 1 1 .4. This indicates that two- 
thirds of the pupils with an estimated score of, say, 40 will have 
scores between 28.6 and 5L4. 

Table 4 shows that substantial increases in Academic 
Learning Time are associated ^vith important increases in 
achievement! Consider the student who started the period with a 
grade two reading score that was average (50th percentile, see the 
top three rows). If this student experiences the average amount of 
Academic Learning Time (573 minutes total, or 23 minutes per 
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Table 4 

Academic Learning Time iind Student Achievement: Examples from 
Grade Two Reading Based on the Beginning Teacher Evaluation Study. 
Phase III-B Results 



Reading Score at 
First Testing 
^(October) 


Student Engaged Time 
in Reading with High 
Success Rate 


Estimated Rcadmg * 
Seorc, Second Testing 
(December) 


Raw • 
Score 
(out of 

100) Percentile 


Total T,imc 
Over 
5 Weeks 
(Minutes) 


Average 
Daily 
Time 

(Minutes) 


Raw 
Score 
(out of 

100) Percentile 


36 50 
36 50 
36 50 


100 
573 
1 300 


4 
23 
52 


37 39 
43 50 ' 
' 52 66 


16 17 

16 n 

16 1"' 


100 ' 
573 
1300 


4 
23 

■ 52 ^ 


20 15 
25 21 
35 36 



Notes: . - 

1. An.:ivcr:igc of 25 school days occurred between the lirst and the second testing, 

2. The B reading scores arc estimated via linear regression. 

3. The values of all variables in this tabic arc within the ranges actually obtained in the 

sample. . 

4. I he average engaged lime with high success fate in grade two reading for the mter-test 

period was 573 minutes. 

I 

day in reading), then the student can be expected to show average 
reading achievement in December {50th percentile again). Note 
also that 'the "average" student with "average" Academic 
Learning Time does show considerable learning in terms of 
predicted raw scores. 

Table 4 also indicates that if this average student (in terms 
of October's test score) experienced only four minutes per day of 
Academic Learning Time (100 minutes total for the inter-test 
period), then she or he would be expected to show almost no 
•change in raw scores (36 out of 100 correct in October, 37 out of 
100 in December) and would decline considerably in relative 
terms {50th percentile in October, 39th percentile in December). 
However, if the same student experienced very large amounts of 
Academic Learning Time, 52 minutes per day in this example, 
then. he or ^he could be expected to answer almost 50 percent 
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more items correctly on the December test than on the October 
test (36 out of 100 correct in October, 52 out of 100 correct in 
December). Furthermore, in this situation the student would 
show considerable improvement in reading achievement relative 
to the otHer students in the study (50th percentile in October, 66th 
percentile in December). Thus, the student with large amounts of 
Academic Learning Time benefits substantially. 

!t may appear that this range from 4 to 52 minutes per day 
is unrealistically large. However, these values actually occurred 
in the classes in the study. Furthermore, one can easily imagine 
how either 4 or 52 minutes per day of Academic LearningTirne 
might comeabout. If 50 minutes of reading instruction per day is 
allocated to a student who pays attention only about a third of 
the time and only one-fourth of the student's reading time is at a 
high level of success, then the student will experience only about 4 
minutes of engaged reading at a high success level. Similarly, if 
100 minutes per day are allocated to reading for a student who 
pays attention 85 percent of the time, at a high level of success for 
almost two-thirds of that time, then she or he will experience 
about 52 minutes of Academic Learning Time per day. 

In summary, large differences in Academic Learning 
Time are associated with very important changes in predicted 
achievement levels. These ^ large differences in Academic 
Learning Time are well within the„ range that was actually 
observed for the Beginning Teacher Evaluation Study. In 
addition, one can easily , imagine how these differences m 
Academic Learning Time could occur in realistic situations. 
Therefore. Academic Learning Time is shown to be of 
considerable practical importance in terms of its relationship to 
achievement. 

Academic Learning Time and Attitude 

The data from the Beginning Teacher Evaluation Study 
revealed that students with high and low rates of allocated and 
engaged time were equally likely to have positive or negative 
attitudes toward the subject matter and the school; Educators are 
naturally concerned about whether greater than average time in 
academic pursuits or greater than average rates of attending will 



Learning Time 



220 



215 



result in negative attitudes. In the Beginning Teacher Evaluation 
Study, that did not happen. There was one consistent, positive 
trend in the data. It appears that students experiencing high rates 
of success are somewhat more- likely to have positive attitude 
toward reading and school. 

Opportunity to Learn, Content Coverage, 
and Curriculum-Test Congruence 

In the recently completed Instructional Dimensions Study 
(Cooley & Leinhardt, 1 978), a very well-done search for effective 
classroom processes, the variable "opportunity to learn" was 
extensively examined. Opportunity to learn was a composite 
variable. A classroom would score high on the variable if it had: 
lower enrollment, higher rates of attendance, higher allocated 
times in reading and mathematics, fewer transfers in or out, and 
higher rates of on-task behavior. Thus, in this study, the 
opportunity variable overlapped with the allocated and engaged 
time measures mentioned above. 

The variable of opportunity to learn also was defined in 
terms of curriculum overlap— an estimate of the overlap between 
what was taught and what was in the end-of-the-year 
achievement test. This is the issue of the degree of congruence 
between the curriculum .that is taught and the achievement test 
used to measure mastery of the curriculum. Another and related 
aspect of opportunity to learn, not studied directly in Cooley and 
Leinhardt, is content coverage. However, others have addressed 
the content coverage issue directly (Borg, 1978; McDonald & 
Elias, 1976). 

The results of these studies are now quite clear and 
consistent (cf. Rosenshine & Berliner, 1978). Opportunity to 
learn, content coverage, and curriculum-test congruence are 
important variables, discriminating between more and less 
effective teachers. 

Cooley an,d Leinhardt say it this way: "In summary, the 
major generalization with respect to classroom processes must be 
that the most'useful construct in explaining achievement gain is 
the opportunity that the children had to learn the skills assessed 
in the achievement test" (p. 32). 
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Opportunity is high when allocated time in some content 
areas-is high, engaged time in that content area is high, content 
coverage in that curricular area is broad, and that the time and 
content choices match the depth and breadth of the achievement 
'tests used in assessing the instruction. 

Direct Instruction 

All these trends in the data can be brought together under 
the still nebulous but semantically rich concept called "direct 
instruction" (Berliner & Rosenshine, 1977; Rosenshine & 
Berliner, 1978).* Direct instruction includes the opportunity 
variable and also refers to a syndrome of classroom variables that 
have empirical underpinnings. For example, classrooms with an 
academic orientation, rather than an affective orientation, 
achieve higher (h^ner et al., 1978b); also, classrooms in which 
the time spent was academically focused are found to be 
consistently higher in achievement. When large amounts of time 
were spent in story telling, art, music, or play, as opposed'to 
reading and mathematics activities, negative correlations with 
achievement were fourtd (Stallings & Kaskowitz, 1974). Related 
to these findings is the well-established consistent finding that 
those classes and schools generally called "open" or "humanistic" 
do not do as well on academic outcomes as those , classe's and 
schools generally regarded as "traditional." Moreover, even 
attitudes about self, school, and subject matter are not higher in 
the more open educational programs (Gage, 1978). 

Academic feedback has been found to be positively 
associated with student learning. Academic feedback is defined 
as information given to the^student about whether his answers 
were right or wrong. Many different specific behaviors were 
conceptualized as fulfilling this function, including answering 
questions in class, checking papers, programmed text, and oral 
reading. The percentage of instructional time during which the 
student received feedback was positively related to student 
engagement rate and to achievement. Hence, more academic 

•Nol to be confused with the Dircxt Instructional Program of Becker and Engelmann 
^ (1978). 



Learning Time 



poo 217 



feedback leads to higher engagement and achievement. (Fisher 
et al., 1978b). 

Structuring of the lesson and giving directions on task 
procedures have been found to be positively associated with 
student Success rate. Teachers who gave directions more often 
and spent time discussing the structure of the lesson had students 
who showed a, higher success rate. Students sometimes do not 
know what they are supposed to be doing or how they are 
supposed to mark a particular worksheet. Clarifying activities by 
the teacher helps raise student achievement, probably by 
affecting the ^'success rate'' component of Academic Learning 
Time (Fisher et al., 1978b; Tikunoff, Berliner, & Rist, 1975). 

The teacher behavior of monitoring was found to be 
important. When academic monitoring acts were high, the 
teacher was able to keep children engaged in their assigned tasks. 
This correlated positively with achievement (Fisher et al., 1978b). 

The classroom environment was also found to be 
important. Classroom environments, characterized as cooper- 
ative on academic tasks, warm, democratic, convivial, and with 
high levels of student responsibility for academic work, showed 
up as positive predictors of achievement (Fisher et al., 1978b; 
Tikunoff et al., 1975). 

The conclusions reached from these studies and the 
attempt to define direct instruction leads to a simple one-sentence 
statement that best summarizes what we now know: If the tests 
they use are matched to ihe curriculum they teach,' then 
elementary school teachers who find ways to put students into 
contact with the academic curriculum, and keep them in contact 
with that curriculunu\hile maintaining^ a convivial classroom 
atmosphere, are successful in promoting reading (and. mathe- 
matics) achievement. \ 

The ''learning student" in such classrooms can then be 
described. First, she or he works on an academic task that is 
designed to xesult in increased knowledge or skills. We have 
noted that the amount of time that the student spends in a given 
knowledge or skill area is directly and positively related, to 
learning in the same area. Furthermore, this appears to he as true 
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for the more conceptual knowledge areas as it is for the more 
basic skill areas." For example, comprehension skills are as highly 
related to time spent in working on comprehension as are 
decoding skills to time in decoding. Therefore, the learning 
student spends relatively great amounts of time working on tasks 
that are directly related to the subject matter to be learnedand the 
test used to measure that learning. 

The learning student is also very attentive. He or she is 
actively involved in the task at hand, probably with some 
enthusiasm. The learning student is busy performing the 
academic part of the task, .rather than sharpening pencils, 
looking for a book, or waiting in line to ask the teacher a 
question. She or he is not "socializing" or daydreaming. 
NevertheteSs, the student is enjoying the activity. Paying 
attenti^ for relatively long periods of time does not upset the 
student. Furthermore, his or her success on the task makes this 
active involvement more enjoyable. 

The learning student spends a lot of time practicing and 
reviewing skills. She or he undertakes an activity related to a new 
skill only after "thoroughly learning skills prerequisite to the new 
skill, so that she or he virtually never encounters an activity that is 

. really entirely "new." At a younger age and/ or an earlier stage of 
education, the learning student spends relatively greater periods 
of time practicing and reviewing. As she or he advances 
academically, it becomes possible to reduce the proportion of 
time spent practicing and reviewing. There is always some need 
for consolidation of acquired skills (practice), but as the student 
advances she or he actually "learns how to Jearn," so that it 
becomes easier to acquire newer skills without as long a period 
for consolidation of prerequisite skills. In addition, attempts to 
learn the newer skills probably serve to consolidate the 

• previously acquired skills, insofar as the more advanced student 
is applying these acquired skills to the newer skills. 

,.:f The need to spend relatively great amounts of time 
practicing and reviewing skills is probably partially a function of 
the fact that students are not constantly tutored. That is; students 
spend the majority of their time working independently or with 
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only intermittent teacher contact. Students, particularly, less- 
advanced students, have difficulty acquiring nev^ skills on their 
own, so they need to practice previously acquired skills when 
working alone. Therefore, as students become more advanced, 
they should 'not only be able to spend less time practicing and 
reviewing, but should also be able to spend more time working 
independently. 

The "learning student" is not necessarily an unhappy 
' student. The learning student does not learn to dislike learning. 
Lots of hard work may sound undesirable to most people. 
However, we do not find any evidence that students are less 
satisfied when the sheer quantity of work (allocated time) is 
relatively great. Furthermore, we do not find that students who 
pay more attention (work intensively) acquire a distaste for 
learning, fn fact, one coufd assume that when attention 
is the result of interest and enthusiasm, rather than coercion, then 
attention represents a more positive attitude toward learning. 
Student engagement does not appear, generally, to be produced 
by the coercive demands of the teacher. 

It is interesting to note that the high-success component of 
learning is associated with mpre positive student attitudes. 
Successful students probably enjoy learning more because of 
their success. Failure, even when it is only occasional, appear* to 
result in a more negative attitude among younger students. THis 
may be less true, however, as students acquire more academic 
experience and become more accustomed to school. 

Discussing what the learning student looks like is done to 
help teachers think about what they are trying to accomplish in 
the classroom. To some extent, the characteristics of the learning 
student are under the direct control of the teacher. Teachers 
make decisions about what to teach and how much time tp 
spend on a particular goal. Teachers should be aware of how. 
much time is really being spent on different skill areas. Classroom 
time is limited, so teachers should be careful to spend fime on 
those activities that they consider the most important. If some 
skills are particularly important for students, it would be 
reasonable to spend large amounts of time on those skills. Thus, 
the depth of coverage is important. But teachers need also to 
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examine the breadth of their curriculum. A wide range of content 
usually needs to be addressed if success on achievement tests is to 
be noted. 

Student success rate is also largely under the direct control 
of the teacher. As teachers assign tasks to students, they should 
try to match the task to the student's skill level so as to provide 
frequent instructional sequences leading to high success. This 
strategy is particularly promising at earlier grades and for less 
advanced students. Note that there have been previous advocates 
of this approach (programmed learning and mastery learning). 
However, many teachers probably do not recognize the extent to 
which less advanced students need practice and review. The other 
side of the success role is also important. Teachers should always 
be careful to avoid giving a student a task which is extremely 
difficult. • • 

Recommendations for Improving Schooling 

1. Ways are needed to monitor and then to increase 
allocated time if it is low. The monitoring can be done by 
teachers, with or without colleagues. It is very revealing and- 
sometimes very startling to teachers. The monitoring of allocated 
time leads to a confrontation with the single most important issue 
facing teachers, school districts, and state agencies: What is to be 
taught? More of one thing means, in a finite system, less of 
another. Those objectives of education receiving high priority by 
society ought to be the objectives receiving emphasis in the 
curriculum within the classroom. These objectives should be 
allocated more time, while other less important objectives 
should receive less allocated time. This very obvious recom- 
mendation is not new. But it is not now being implemented. The 
lack of implementation is not always conscious. Many teachers 
are never given feedback about these aspects of their teaching. 
They can, therefore, end. up building some perfectly constructed 
teaching-learning units that would have a very low time priority 
in the community they serve. In what ways can the classroom 
walls be scaled and feedback given to teachers about the use of 
time? What district resources can be mustered to provide this 
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kind of monitoring? What professional rights of teachers must be 
taken into account in order to do such monitoring? 

2. An intensive examination of the school curriculum and 
the content of examinations is needed. The content of instruction 
and the content of the tests used to assess instruction should 
reflect those things valued by a community (see above). 
Moreover, the tests and the instructional activities should be 
congruent. It is not likely that a second grade class with no school - 
based training in syllabication will succeed on the half dozen 
items measuring knowledge of syllabication that are given in the 
end-of-year achievement test. Who addresses these issues in each 
district? Who monitors whether congruence exists between what 
is taught and what is tested^ 

3. Success rate in th^ chosen curriculum should be high, 
particularly for younger and less academically oriented students. 
With many classes averaging 30 students, and aides often not 
permitted to work in instructional roles, the ability of a teacher to 
accurately assess the success rate for a particular child with a 
particular set of materials is limited. Classroom arrangements 
such as smaller class size, more instructional aides, more time 
allocated per day to assess class work, quicker ways to provide 
help when students are working independently, etc., all could 
help keep success rates high. 

4. Academic Learning Time— engaged time with aca- 
demic materials or activities, that are matched to some test and 
provide a high success rate— is a proxy for end-of-unit or end-of- 
year achievement. Until now, few classroom observers kn?w 
what to look for when visiting classrooms to monitor instruction. 
Thus, such silly notes about the grooming of the instructor and 
the neatness of the bulletin board would become part of an 
observational record. But alt, with known relations to 
achievement, can be monitored at any time; alt may be 
considered learning, as it occurs. This means that the cnd-of-year 
achievement test need no longer be the sole criteria of successful 
teaching. Successful teaching can now be defined, in part, 
although ALT is a process variable. Jhis is a very important and 
new conception of teaching and learning. 
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5. The academic press and other environment variables in 
a class are important. No one denies that schooling is to socialize 
students, to exploi'e feelings, to learn the arts, etc. But when 76 
minutes per day are spent in transitions from activity to activity 
(real data), or 85 percent of the elementary school day is spent in 
other than reading and mathematics activities (real data), young 
students cannot take very seriously the importance of academic 
achievement. These variables can be monitored and simple 
suggestions can be provided for improvement. 

6. Teaching functions, rather than teaching methods or 
skills, are important. "No one technique of instruction is clearly 
associated with disastrous outcomes or successful ones" (Cooley 
& Leinhardt, 1978, p. 40). McDonald and Elias (1976), in related 
research, found that there were patterns of instruction that were 
effective, but these differed from teacher to teacher, from grade 
to grade, and from context to context. Certain teaching 
functions, however, always need to be met for successful 
classroom experiences to occur. Academic monitoring must be 
accomplished in some form, though many different teaching 
behaviors can fulfill the function. Diagnostic and prescriptive 
functions must be carried out, though these can be accomplished 
in many different ways and can be done well or poorly, in 
individualized or non-individualized settings. Nonetheless, some 
diagnosis and prescription must take place. Feedback must be 
given to learners. Dozens of ways to provide such feedback are 
known. What is important is that feedback in some form takes 
place. have become too concerned about the relative 
effectiveness of open classrooms, personalized instruction, 
deductive and inductive methods, high and low cognitive levels of 
questions, etc., and have lost sight of the factthat certain teaching 
functions must be met regardless of the method used. There 
probably is no one best way to teach anything to all students, but 
there probably are similarities in the teaching functions met in all 
successful ways of teaching. 

7. Because of the complexities of today's elementary 
school classroom a conception of the teacher as executive is 
important in training teachers. Today's teacher is not trained to 
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manage four resource centers, traveling students, the scheduling 
of speech pathologists, special educators, aides, paraprofes- 
sionals, computer terminal time, and also engage in record 
keeping for rnainstreamed students and the preparation of 
individualized programs for all students. To find time for 
planning and carrying out direct instructional activity requires 
executive skill. But training for executive behavior is noticeably 
lacking in both preservice and inservice teacher education 
programs. 

8. The use of most games, movies, slide-tape materials, 
television shows, etc., regardless of their purported educational 
value, must be treated with suspicion. Heavy investment in 
technological aids of any kind is suspect because the match of 
what those materials teach with both the tests used to assess 
instruction and the accepted curriculum is usually quite poor. 
Most students' involvement in games or with media can be 
defended, but twt by relying on arguments about i\\t\v^ direct 
instructional effects. In fact, empirical evidence on the relation 
between time spent in such activities and achievement is negative. 

Conclusions 

Simple conclusions about what is successful in classroom 
teaching and learning have been provided. After many millions 
of dollars spent on research, the research community has 
validated some commonsense notions about education. But 
comnnm sense is not common practice, as a visit to the schools 
will reveal. The ways to change practice to maximize opportunity 
to learn are many and varied. Successful teaching and learning 
can occur within almost all pHilosophical positions and 
educational programs. Direct instruction, with high rates of alt, 
can be obtained without turning schools into factories and 
without using authoritarian or coercive measures. A student's 
ALT falls off when classrooms are too casual and non-directive, 
and ALT also falls off when the schools are too coercive and 
authoritarian. A middle ground is needed. A convivial, 
democratic, warm classroom with a teacher concerned about 
direct instruction seems to succeed in promoting achievement of 
the kind measured by most standardized achievement tests. 
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The Role of Reading in Bilingual Contexts 



Warwick B. Elley 

University of the .South Pacific 

In spite of the rapid upsurge of interest in bilingual education in 
the past decade, Ave are forced to agree with Laosa (1978) that 
^^comparatively little research has been conducted on bilingualism 
and bilingual education,". and "the research literature that does 
exist is fraught with contradictory results." Under such circum- 
stances, instructional methods are adopted and abandoned 
arbitrarily. Extreme and contrary views are widely expressed on 
how and when we should teach children second languages, and 
there is uncertainty amongst researchers about which variables 
we should be studying. 

It is recorded that 16 percent of American children now 
have English as thek second language. In many parts of the world 
the figure is considerably higher. In the South Pacific, where the 
writer is currently working, more than 95 percent of the school 
children learn English as a second or foreign language. For these 
Children, the uncertainty which reigns about acceptable theories- 
and practices in learning English in schools is critical. For 
English is the language of the high school and the university and 
for most it is the path to secure employment. Without fluency in 
English, the South Pacific child stands little chance of succeeding 
either in his school exaniination or in a white-collar job. But 
without fluency in his own language, he loses contact with his 
family and friends,^and the traditions that enhance his self- 
respect. The position is similar in many parts of the American 
Territories of the Pacific, in the Philippines, in Africa, in South 
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-East Asia, and in many more countries where the language of the 
home diverges from that of the school. How can children best 
acquire the language of two cultures? If we are to help the cause of 
millions of children in bilingual settings, there is an urgent need 

.for clarification of the issues involved in learning a second 
language., " . ' 

In'this article, it is planned to examine some of these issues 

.concerning the best w^y to teach English as a- second or foreign 
language, to describe some data v/hich attempts to throw light on 
these problems, and to propose a new and increased role for 

. reading in improving instruction in English as a second language. 
For it is argued here that one major reason for the slow progress 
of most children in bilingual educatipn is the deliberate neglect of ' 
reading in this process. The current ferment in psycholjnguistics 
and in research into the process 9! reading comprehension has 
produced new and important insights into the nature of language 
acquisition. JJnfortunately, little of this knowledge has rubbed 
off in bilingual education. With so many futures at stake, we 
cannot afford to leave these gaps unclosed. 

77?^ Bilingual Context /. 

In this article, bilingual education refers to education-- 
which is provided through" the medium of two languages. This 
includes all cases where children coine to school speaking ii ^ 
vernacular and switch at^some stage of their schooling to a second ' 
(or thirdy ianguage as the medium of instruction, while retaining 
and/or-developing their competence in the first. In some circum- 
stances the'transition takes place immediately upon entrance to 
school, as in the French immersion prdferam's pioneered, by 
Lambert ^1972). However, the typical pattern in the South 

• Pacific is for children to arrive at school with a good oral 
command.of their mothertongue— be it Fijian, Hindi, Samogn, 
Tongan, .or one of the many vernaculars in current use— and to 
have most of their instruction in the first three or four years in 
that vernacular.. The switch is gradual, with no more than 30 
minutes a day of English. as a second language in the first year, 

, and a marked increase in English in grades three or four. At this 
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stage most instruction is in English in all subjects,>part from a 
lesson a day in the vernacular, up to the middle of high school In 
countries which have only-one-vernacular, the switch takes place 
later; in countries with two or three widely used vernaculars the 
transition is largely completed by class four; in countries which 
have a multitude of home languages, the switch often occurs as 
soon as the child begins school. In most cases the teachers are 
bilingual, and moving back and forth from one language to the 
other in one lesson is very common. The basis for these various 
policies is found in a mixture of tradition, convenience, and 
parental pressures. Rarely can the administrator point to 
experimental evidence, or even an assessment of needs. 

Approaches to Language Learning in Bilingual Contexts 
Any reader professing an interest in methodology of 
second language instruction could hardly fail to recognize 
dramatic changes in the methods and rationales of such 
instruction in recent years. As Strevens (1977) points out, 
^^Unlike the scientific disciplines of linguistics and psychology, 
with which it has been linked in the past two decades, language 
teaching has remained an art and craft." And like all arts, it is 
subject to rapid changes of fashion. 

The prevailing fashion in foreign language teaching early 
in this century was formal and classical, with a heavy emphasis on 
grammar and translation. During the 1920s the direct method 
was introduced in some countries. Stress was placed on hearing 
the target language spoken, much as the child learns his first 
language naturally,' and the learner's native tongue was not to be 
used ..The idea of.strict controls of vocabulary and structures was 
popularized in the 1930s, so that the learner's input at the point of 
instruction >y^s to be carefully matched to his knowledge and 
stage of readiness. Audiolingual methods were widely used in the 
1940s, when crash courses were required for armed service 
personnel, particulafly.in Britain and the United States. These 
methods were further developed in the 1950s, drawing on 
theoretical rationales from "the work of such linguists as 
Bloomfield and Fries, and of behaviorist psychologists. Lan- 
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guage was seen as a set of habits to be learned and drilled like any 
motor skills, under conditions of repetition and reinforcement. 
The rules and structures of the target language were to be taught 
one at a time, orally at first, later through practice in reading and 
writing. In the developing countries of Africa, Asia, and the 
South Pacific where English was taught as a second or foreign 
language to young children, alongside or after the vernacular, it 
has been the audiolingual method which commanded the most 
adherents. 

More recently the work of Chomsky and the ''nativists'' of 
linguistics has given rise to a new approach to the learning of 
languages, both first anid second, Chomsky's attack on the- 
behaviorist view that languages are learned b^ imitation, 
reinforcement, generalization, and other such concepts has been 
widely publicized. Recent empirical study has provided much 
evidence of the young child's remarkable ability to produce 
unheard language structures and to generate novel ones- 
abilities which seemed to suggest a child who is predisposed to 
acquire certain patterns in language learning, to hypothesize and 
to correct his own hypotheses, and thus to gradually approximate 
the language of the adult. The key concept here is the child's 
intuitive creation of his own language code, not the conscious, 
systematic block-building analogy of the structuralist-behaviorist 
approach. It is as if the child is born with a predisposition to 
acquire certain language forms. All the teacher needs to do is to 
expose the child to a variety of structures and vocabulary in 
meaningful communication, and the child will develop his own 
language rules, easily and intuitively. 

Many more methods and theories have been promulgated 
in the search for the. best or optimum method of teaching a 
second language. Kennedy (1973) lists 13 different methods 
which have been seriously used and advocated, and suggests that 
all have been partially successful. However, his point should be 
reiterated that nearfy half the world's children may be fluent in 
two or more languages without formal instruction; yet few taught 
formally, in school, have ever been very fluent with any of the 
methods he enumerates. Clearly we have much to learn about 
hovy best to teach another language. 
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Why Is Second Language Learning Ineffective? 

Why do second language learners find their task so 
difficult? The reasons advanced for Kennedy's arresting conclu- 
sion have been many and various. Some point to the difference in 
motivation. The child who has mastered one language, has 
already learned how to satisfy his natural urge to communicate 
with those around him. Learning the second language for a 
distant examination, or vocational purpose, or to satisfy a 
bureaucratic or academic requirement, is a less compelling 
- reason than the ever-present desire to communicate with friends 
and family. As N4acNamara (1971) has pointed out, the teacher 
rarely has anything important to say to the child, and, the child 
has nothing important to say to the teacher. 

Others point to the interference effects of the first 
language on the second. Contrastive analysis has indeed 
identified a number of deviations in the expressions of second 
language learners— in phonology and syntax— which are readily 
traceable to the effects of the first. 

However, more recent studies (Dulay & Burt 1973) 
suggest that these interference effects, at least in the structures of 
the second, language, are limited to less than 10 percent of 
observed deviations, while others argue that these effects are 
limited to the first stages of learning, and appear subsequently 
only when the speaker is tired or under stress. These hypotheses 
need further investigation in a variety of language situations. 

Another plausible reason for the difficulties of the second 
language learner is the limited opportunity to hear and practice 
the iknguage. Even in the "immersion" programs where pupils 
are required to use the target language at all times, they spend 
only 25 hours per week in the second language environment. And 
in manv parts of the South Pacific the figure is considerably less 
than this. By contrast, the first language learner is exposed for 
most of his waking hours— over 100 hours per week. 

Many theorists regard imitation or modelling as a critical 
feature in learning a second or foreign language. When the 
teaching model is sound, where the structures are substantially 
correct, the vocabulary judiciously chosen, the pronunciation, 
accurate, the message clear and important -under these circum- 
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stances the child has a head start over one exposed to poor 
teaching models. Regrettably, bilingual teachers are not often 
equally fluent in their exposition of both languages. Examples 
abound of clumsy expressions, poor spelling and faulty diction, 
Townsend and Homer (1978) quote an example from an English 
lesson in a Pacific elementary school, where the teacher was 
struggling to teach her children that "1 f we doesn't be careful with 
these matches, it will burn ourselves," Where children and 
parents recognize their teachers' inadequacies, as they frequently 
do, progress is inevitably slow. 

Another hypothesis which may explain the difficulties 
experienced by the young second language learner is the 
artificiality of the language he is exposed to. Rarely do the drills 
and patterns practiced in early oral and written language sessions 
approximate the natural language that the first language learner 
is exposed to. As Kennedy (1973) puts it: "From the incredible 
structural richness of a language, we, ' the teachers, select 
phonological, syntactic, lexical and thematic items; we decide 
and arrange the sequence of their presentation to the student; we 
force him to practice the rules we think are being learned" (p, 75), 

Certainly, common sense- suggests that we make some 
allowance for simplifying language for. young learners,- If 
however, the child can learn a first language without systematic 
screening out of new words and structures, it behobves us to keep 
an opea mind on the question of how far this screeningshould be 
taken with, second language learners. If children do learn a 
language by deducing their own rules about its structure and 
occasions for use, the systematic screening .process may actually 
slow down the learning. When the child never has the chance to 
hypothesize, il) check, and to confirm or correct, his pace of 
learning will \^nevitably be impeded. This question of the 
optimum ration^of known to unknown words is still an open one. 

In sum, the second language learner, for a variety of 
reasons, has an uphill battle ahead of him, and instructional 
approaches have, to date, rarely succeeded in producing fluent 
confident users of the second or foreign language. Clearly we 
need some fresh approaches to the problem. If present day 
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methods are Uninspiring, it is time to examine their assumptions 
and see whether they are at fauh. 

The Audio lingual Method of Instruction 

Writing in 1975 Diller pointed out that 15 years earlier, in 
a "state of the art" address to the Ninth International Congress 
on Linguistics, William Moulton had been able to summarize 
two decades of increasing consensus amongst his peers, that 
language was best taught through an audiolingual approach, 
which consisted of "mimicry, memorization and pattern drill." 
Language was not thought of as rule-governed, but as a set of 
speech habits learned by conditioning and drill. 

It is true that the audiolingual approach was the widely 
accepted method of tesl in the early 60s. Structuralism in 
linguistics was at its height; and few linguists had any substantial 
grounds for challenging the psychological tradition that 
languages arc skills, learned chiefly by the mimic-model, with 
abundant practice and drills. 

Diller went on to assert, however; that the linguists of the 
middle 1970s saw no such consensus in the methodology of 
language teaching. Adherents of the audiolingual method are still 
alive and well, but their case is now widely challenged. Let us 
examine a typical audiolingual program and its assumptions. 

In the Soutfi Pacific, the audiolingual tradition has long 
dominated the English language learning scheme. During the 
early 1960s, Gloria Tate developed a program for use with Cook 
Islands Maori children learning English in elementary schools, 
using structurialist principles derived mainly from those of 
Charles Fries, and a behaviorist orientation in learning. This 
program has subsequently been extended for use in all countries 
* of the South Pacific, as well as many of the American territories. 
It is now produced and distributed by the South Pacific 
Commission (spc) and is used in virtually all primary schools in 
the region! Its principles are taught in all teacher training colleges 
and promoted by the advisers associated with all education 
departments. / 
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The rationale of the SPC/Tate Program has been set out in 
a number of handbooks (Tate 1967, 1971). Briefly, it assumes 
that language is a set of habits learned by imitating good models 
through regular drillf. Each new structure and word is to be 
consciously identified and its use practiced in oral form, 
following the teacher's model. Thus the scheme is predominantly 
an oral English program. Subsequently, the same structures and 
words are practiced in reading through the specially written spc 
Junior Reader Series and several other sets of graded readers. 
Errors are" to be avoided at all costs, as they are difficult to 
eradicate. Therefore particular care is taken to ensure that all 
English language the children hear or see is strictly controlled, so 
• that errors do not occur. One way to ensure this avoidance of 
error is to so control the reading books associated with the oral 
program that children are never exposed in print to structures 
and vocabulary that they have not previously learned in their oral 
lessons. Indeed, the guiding principle is that there should be a gap 
of twelve months between the presentation of oral and written 
forms in the early stages, narrowing to a three month gap or less 
at the upper primary school level. 

These principles are not unusual in an audiolingual 
program. More specifically, the idea that mastery of the oral 
language should take place before expo.sure to written forms, in 
case of interference, is still widely recommended. In his 
authoritative summary of bilingual teaching in the' United 
Kingdom, Derrick Sharp (1973) maintains that second language 
learning should begin in the'infant school (age five) and should 
remain an entirely oral activity until the late junior stages (ages 
ten to eleven) or the early secondary years for the less able pupils. 
He refers also to the .progress made in recent years in the stricter 
control over vocabulary and the carefully graded readers pupils 
are exposed to. Similarly, an orthodox view in North America is 
summed up by Ching ( 1 976) when she says that "before bilingual 
children can learn to read English,' they must be able to 
understand and spealc it effectively," (p, 4) and "items should be 
presented in spoken form before they are presented in written 
form" (p. 33). It is important to examine this assumption. 
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Why Delay Reading? ^ , ^ 

The reasons for the audio-lingual claim that reading 
should be so long delayed, and thus relegated to a minor role in 
language development, require further investigation. They 
clearly have inhibiting effects on teachers and learners. I n the face 
of this belief, pupils are not encouraged to read widely. Teachers 
are not active in building up libraries. Parents can have little role 
^to play in leading children to books. Is this restriction on reading 
really warranted? 

Wheeler (1974) suggests that the delay is based on a false 
analogy with first language learners. Monoglot children 
normally learn to read only when they have built up a large oral 
language ^repertoire in their preschool years. The fund of 
knowledge about words and structures certainly does provide a 
useful source of semantic cues for speeding up the reading 
process when other cues are weak. But the learner can also resort 
to cues from pictures, from text redundancy, from analogy with 
the vernacular, or from the teacher's translation. To wait until the 
children, have acquired a large repertoire means a long wait, 
duriqg which they are excessively dependent on oral methods. 
'Many children are "visualizers," who preferto see what they are 
learning in print. Certainly there are empirical data which show 
high correlation between oral language fluency.and early reading 
achievement, but we cannot argue from this fact that the oral 
strength is the cause of the facility in reading/ In fact, after a 
longitudinal study of the progress of Samoan and Maori children 
in learning to read,. Clay (1970) suggested that early reading 
progress depended more, on "progress in visual perception of 
print" than on sophisticated command of oral English. 

Another allegation made against the early introduction of 
the printed word to the young bilingual learner is that there is a 
danger of interference. The child who sees a printed form which 
does not fit phonetically with hiis pronunciation of it is likely to 
revert to a speech pattern which fits his vernacular preconception 
of how it is pronounced. Evidence for this view is not presented 
by its adherents, however, and it is not likely to occur in cases 
where they are considerably different. Even if it were a danger, a 
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case could still be made for the simultaneous introduction of both 
oral and written forms, rather than a lengthy delay. Of course, the. 
argument could also be turned around. Children are just as likely. . 
to invent incorrect spellings for the oral forms they have learned 
if they are not given the correct visual form of the word. And this 
will interfere with later written composition work. But it is time 
to look at, the empirical findings. / 

Research on the Use of Prim in Learning Language 

It has long been recommended by educational psychol- 
' ogists that a multisensory approach to learning new material will 
provide better results than a single-sensory niethod. When new 
spelling words are taught systematically, it.is common procedure 
to have children see the new word, say it, write it on paper, write it 
in the air, close their eyes and visualize it, and thus produce a 
greater impact on the pupil's brain than a single oral or visual 
presentation might. Does this multisensory principle apply to 
learning a new language? According to research on bilingual 
education in Wales, it certainly does. 

Dodson (1967) compared four methods of learning 
sentences in a second language on the part of pupils of both 
elementary (ages eight to nine) and secondary school (ages thirteen 
to fourteen). The four variables he studied were the presentation 
of the spoken form of the target sentences, the nrilien form, an 
•accompanying and the r}70ther tongue equivalent. Each 
child was interviewed individually, andj the criteria tested were 
fluency in speech and comprehension of meaning. For the 
younger children the most effective combination, by far, was that 
in which the picture, the spoken word:and the printed word were 
presented to the child. Least efficientiwas the picture plus spoken 
word The presentation of the mother tongue equivalent assisted 
learning for most children, but not as much as the presentation of 
the printed sentences. The trends were similar for the secondary 
school pupils, but the value of vernacular equivalents appeared 
to be nearly as great as that of the printed word. On a time-to- 
learn criterion, the best approach was that which employed all 
four variables simultaneously. Clearly the multi-sensory ap- 
proach finds support here. 
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In a series of follow-up studies in classroom contexts 
investigating the role of the printed word in learning German 
sentences, Dodson found similar results. When the printed words 
were exposed while the children listened to and repeated the 
teacher's oral presentation, the teaching time required was 
greatly reduced. Further confirmation was found in the learning 
of number concepts and in the recal: of sentences learned earlier 
In each case the printed word was a clearly effective aid to 
learning. 

As for the problem of interference in pronunciation due 
to the presentation of the words in print, Dodson states that it 
hardly ever arose. "Only a handful of letter combinations caused 
temporary interference lasting no longer than a few minutes, 
before a correct pronunciation was achieved" (1967, p. 20). On 
the other hand, the absence of the printed word was a problem for 
many who were unable "to sort out the tangle where one word 
ended and the next began." In Dodson's view, the presence of the 
printed word enables them to match particular sounds with 
initial letters of the words— or syllables in polysyllabic words— 
and so provide signposts to help them through the language. It 
also lejd to fewer problems of spelling later, and provided more 
time to practice good pronunciation. 

As Dodson pointed out, the use of print in initial language 
learning was unpopular with theorists at the time of his research. 
Whatever arguments prompted this Viewpoint seem to be^^ 
consistently demolished by Dodson's experiments. Yet, over a 
decide later the assumption is still widely held, and its 
implications extensively preached to teachers and ,would-be 
teachers. - 

Can Second Language Learners Teach Themselves? 

If children do not suffer when exposed to print while 
learning oral language, the question must now^ be put as to 
whether they might actually benefit by learning new language 
from the printed word, independently without the teacher's 
guidance: Is there learning potential in unfamiliar words? To 
what extent can pupils cope with new structures in- print? Can 



Reading in Bilingual Contexts 



212 



237 



they make some sense out of unknown language and go on to 
teach themselves new forms? 

The audio-lingual approach is again adamant on this 
point, Reading is to follow, not lead. Tate (1971) points out that 
"children should understand the meaning of whole printed 
sentences and of whole printed paragraphs immediately when 
they read them." Pitman (1974) in his rationale for the SK /Tate 
Program asserts that: "language cannot be taught through a 
reading program." 

These views stand in marked contrast to those expressed 
by recent psycholinguistic theories of reading in a first language. 
The audio-lingual proponent says we should avoid error at all 
costs. Yet first language learners who are prepared to guess, to 
hypothesize and confirm are now considered to be the best ^ 
language learners. Goodman sees reading as a psycholinguistic 
guessing, game. "Reading becomes a sample, predict, test, 
confirm and correct-when-necessary approach" (Goodman 1976, 
p. 238). Frank Smith confirms this viewpoint. "The most 
preferred and efficient strategies for proficient readers when they 
come across a word that is unfamiliar are to skip or to predict 
from context or by analogy with other words. By conducting 
experiments as we read, not only do we learn to recognize new 
words, we learn everything else to do with reading" (Smith 1978, 
p. 97). "Children do more than learn to read through reading; 
they learn language" (p. 70). 

To what extent are these principles'applicable to second 
language learners? Can a child without an extensive fund of 
words and structures be expected to detect redundancy in 
English, to learn from context? If we can venture to expose him to 
the unfamiliar, without harm, then many practical implications 
will follow. 

Firstly, more interesting reading material can be put into 
children's hands, materiafs that are not so strictly graded that high 
interest (but unfamiliar) words are screened out. Reading could 
become a more attractive activity. In consequence,., pupils- 
motivation to attribute meaning to new words will increase. They 
will want to know in order to satisfy natural curiosity, not to 
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satisfy the teacher. On the other hand, those who are perpetually 
protected from the unfamiliar printed word will not develop a set 
'to predict c>f hypothesize. Faced with a new word or structure 
they will stop and asl: for help. Those who never practice 
indepevyticnce will not achieve it. Furthermore, if children who 
read v-idely learn more about their language, then the sooner they 
'lire encouraped to forge ahead on their own the better. To forSow 
•tlje audio-lingual restrictions is to discourage extensive reading 
for interest, to slow down growth towards independence in 
learning and to retard language development. 

What evidence is there on the ability of second language 
learners to cope with new language structures in print? On the 
basis of observation of children in Hawaii and California and 
with the help of videotape playbacks, Joan Rubin (1975) has 
isolated some of the strategies characteristic of good second 
language Jearners. At the top of her^list is the guessing strategy. 
"The good guesser uses his feel for grammatical structures, clues 
from the lexical items he recognizes, clues from the redundancy 
in the mesage. He uses nonverbal cues, word association clues, 
outside knowledge (his general knowledge of society, of 
similarities to his native language). He makes inferences as to the 
purpose, intent, point of view of a message." Rubin sees no 
difference between first and second language; learners in these 
respects. Nor does Twaddell (1973) or Vivian Cook ( 1969). And 
guessing is possible for children as well as adults. Yet guessing is 
strongly discouraged by the audio-lingual approach. Can we find 
some evidence on this point? Perhaps it does not apply with 
young children. Perhaps there are qualifications in situations 
where English is. not used putside the classroom. What kind of 
structures can-children cope with? Can they use context to guess 
the meaning of unfamiliar forms? These questions are important 
in portraying a role for reading in bilingual contexts. 

Experiment on Learning from Context in Print 

In an attempt to probe some of these issues, the writer 
conducted an investigation'with bilingual children in grade five in 
two South Pacific countries. In one country, Fiji, English is 
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learned predominantly as a second language, after initial reading 
instruction in the vernacular. In the other country, Niue, English 
has the role of a foreign language, as it is spoken only rarely 
outside the school. In both countries oral English commences in 
grade one; English, reading, in grade three. 

Twenty-four structures taught orally in the Tate Oral 
Program in grade six were identified and placed in the context of 
meaningful sentences or short paragraphs. These structures had 
not been taught to the pupils in the investigation. There is a 
rigidity about the sequence of the Tate Syllabus which ensures 
uniformity in this respect. The question to be examined was how 
well the pupils would comprehend each of the structures in a 
meaningful reading context, heforexhty had learned them orally. 

The sample of pupils studied consisted of 100 typical 
grade five children (eleven years-old) from three schools. One was 
a below-average urban school of Fijian and Indian children from 
a predominantly working-class suburb of Suvar another was a 
middle-class school in an innercity suburb, with a predojninance 
of Indian pupils, again in Suva. The third class was drawn from 
Niue, in the sm^ill town of Alofi. All pupils had learned to read 
first "in their home language, and were in their first year of 
learningto read in English. All were using the Tate Oral Syllabus 
and its associated structured readers. 

The unfamiliar structures to be read were presented in 
>;hort simple sentences or paragraphs, and a parallel set of 
sentences containing familiar structures was prepared as a 
control device. Each child read twelve of the familiar and twelve 
of the unfamiliar structures. The study was conjiuctcd in intact 
classroom groups. After reading each sentence the children 
responded to open-ended questions, assessing their compre- 
hension of the meaning of the .structure in question. 

Examples ^ - 

(I) (a) Unfamiliar structure: The children next door were 
frightened by the little dogs, 
(b) Familiar structure: The little dogs frightened the 
children. 
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QUESTIONS (i) Who was afraid? 



(ii).Who frightened them? 



(2) (a) Unfamiliar structure: Mother said, "There are hardly any 
bananas left, Rima. Go and buy some more. There's a 
shop not far from here'' 
(b) Familiar structure. Mother said, "We have only a few 
bananas left, Rima. Go and buy some more. I saw a shop 
near here.'' 

QUESTIONS (i) Did Mother have any bananas at all? 

(ii) Was there a shop near them? 

In each class, pupils were assigned at random to one of two 
groups, X and Y.. Pupils in Group X would read sentence 1(a); 
pupils in Group Y would read sentence 1(b). Both groups re- 
sponded to the same questions. Then pupils in Group X would 
read Sentence 2(b); those in Group Y would read Sentence 2(a); 
and so on.' 

' Table 1 sets out the results f(j>r 18 of the comprehension 
questions asked, expressed in percentage form. For these^lS 
familiar structures, the pupils had little difficulty. All percentages 
were well above 60 percent and very similar^to the figures for the 
comparable familiar structures. The mean score on the unfamil- 
iar structures was 78.33 percent, just 4.61 percent less than the 
mean score for the familiar structures. Apparently these pupils 
can comprehend structures which they have not been specifi- 
cally taugfit. Apparently they can and do learn from context. Yet 
the Tate Program prevents^hese children from either seeing or 
using such structrures. 

Table 2 presents the figures for the remaining 6 structures 
which were not well understood. All showed percentage correct 
. scores less than 60 percent, and the mean score was 25 percent or 
53 percent less than that obtained from the familiar structures. 
Closer inspection suggested that these were inherently difficuh 
language forms even for more mature children. Perhaps they had 
no place in the grade six program at all. To investigate this issue, 
the same sentences and test questions were presented to a class of 
35 grade seven pupils in Suva from the same suburb as the first 
group of class five children. These children had been taught the 
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Table 1 

Grade Six Language Structures Well Understood 



1. In order to , , . 

2. When Joe had done... 

3. . . . were frightened by . . . 

4. ...can't be true. 

5. . . . went to his mother carrying . . . 

6. ... on which the paper was put . . . 

7. . . . without making a sound 

8. It's such a big cat that... 

9. . ! . too strong for me to . . . 

10. Xan it be a... 

11. It must be a... 

12. It doesnH take him long to . . . 

13. He is also big. ' 

14. It must take him a long- . . 

15. There's a shop not far from . . . 

16. That-s not enough for. . . 

17. . . . cents for you to take. 

18. ...so that he wouldn't be 

. Mean: 





Comparable 


1 1 n milt!) r 


Familiar 




Structures 


68% 


76% 


86% 


96% 
84% - 


89% 
68% 


. 82% 


76% 


' mo 


65% 


90% 


79% 


92% 


92% 


84% 


96% 


95% 


87% 


66% 


76% 


■ 71% 


63% 


76% 


76% 


9'4% 


71% 


84% 


79% V 


. 82% 


76% 


76% 


87% 


87% ' 


76% 


76% 


78% 


83% 



particular structures tested in the grade six year. Therefore the 
hypothesis was that their ability to comprehend these structures 
in writing would be significantly better than that of class five 
children. Table 2 shows that this hypothesis was oot, in fact, 
supported. Grade seven children scored 28 percent correct on 
these 6 structures, only 5 percent more than children in grade five 
and still well below the figures for the comparable familiarstruc- 
tures. Apparently, the effects of specific instruction in the struc- 
tures in question have little or no effect on the ability of the pupils 
to comprehend them in print, r. . - 

These findings seem to be consistent with the results of 
recent studies by Dulay and Burt ( 1 973) with bilingual children in 
the United States. These investigators used comparative error 
' analysis to sho\v that Spanish-speaking elementary school pupils 
make the same kinds of errors in learning English as a second 
language, regardless of whether they were systematically taught 
the relevant structures or not. Their results were interpreted to 
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. 53% 


(96%r 


65% 


(100%) 


34% 


(78%) 


^^48% 


• (84%) 


13% 


(39%) 


13% 


(68%) 


13% 


(97%) 


5% 


'.(91%) 


8% • 


(66%) 


-J 8% 


'(83%) 


,16%. 


(82%) 


18% 


• (87%) 


'23% 


(76%) 


28% 


(86%) 



. . Table 2 
Grade Six Language Structures Not Well Understood ^ 

^ . Gr 5 (Control) Gr 7 (Control) 

J. Neither Peni nor Subhas saw the cat, 

2. Anjula hasn't been at school since. 

3. We have hardly any bananas left . . . 

4. A II the children but Sashi have : . . ^ 

5. "It's too far for him to . . . 
6- A mile further than . . 

Mean: 

*/ - • . ' 

mean that it is a waste of^tme to teach children syntax dehber- 
ately, a conclusion which^s clearly at variance with, the spirit of 
audio-lingual methods. 

Such findings raise serious difficulties far any program 
which is based on the assumptiofi that children should be taught 
^language structures in a particular sequence, and that -reading 
materials^should be graded insuchawaythat children are not ex- 
posed to the unlainiliar On the contrary, bilingual learners in 
typical schoqls can and d<>\learn from written context. They can 
make sensible and correct predictions about unfamiliar struc- 
tures in meaningful settings, and instruction in such structures is 
•apparently not helpful for them. The results -in the four South 
Pacific schools in the study were virtually identical. The same 1 8 
structures were well understood, regardless of whether the 
children were learning in a second or a foriegn language situa- 
tion. 

^ One implication of these studies is that children should be 
encouraged to read more extensively. If bilingual pupils can 
teach themselves the meaning of new language when they see it in 
print, then they should not be restricted to the few highly struc- 
tured readers that make up the reading fare of the typical child 
learning in an audio-lingual program. In the^euth P^ific this 
conclusion implies a complete, change ot policy. For most 
primary schools Have few or no library books; no indigenous 
literature exists for South Pacific children; and most pupils never 
read inde'pendently for pleasure. In fact, many teachers take so 
seriously the'restriction that children should not be exposed to 
unfamiliar words or structures that they never read stories to 
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children, in case they may hear something new which would 
cause confusion and error. Such inhibiting policies may well be 
the root cause of the now widely documented weaknesses in 
children's learning of English language in the South Pacific (see 
Elley, 1979; Elley & Mangubhai, 1979; Stamp, 1979; Townsend 
& Homer, 1978). Standards of English reading and writing are poor 
and apparently declining; high school students cannot under- 
stand their textbooks; university students are requesting assist- 
ance with the comprehension of lectures and texts. And where 
English is a problem, learning is slow in all subjects. 

Evaluation of a Reading- Based Program 

Not all South Pacific educators have been pleased with 
the audio-lingual approach to reading. The director of education 
in the small island of Niue, De'Ath ( 1 978), a former New Zealand 
adviser on reading, was not impressed with the progress of his 
primary school pupils in English language, or with the lack of 
emphasis on reading in his pupils' English syllabus. The spc/Tate 
Junior Readers, which are used in Niue and in most South Pacific 
schools, are deliberately graded and articulated with the oral 
English syllabus in such a way that pupils see and practice only 
. familiar words and structures. The choice of language is dictated 
by the need for repetition of logically sequenced structures; 
interest meaning, and natural language patterns are frequently 
sacrificed for the demands, of form ^nd structure. Typical 
quotations from the Junior Series Book 3 demonstrate the kinds 
of problems faced by the authors in making the, materials 
appealing for children. 

1. Tom has a tin and Peter has a tin, too. 

Tom is holding his tin and Peter is holding his tin. too. 

2. Here arc Peter and Mary. 
Are they going home? 
No. they aren't. 

They're going to Tom's house. 
/\re they good children? 
Yes. thev are. 
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Is Mary's dress clean? 

Yes. it is. ; 

Are licr hands clean? 

No. they aren't. . - 

It is difficult to imagine South Pacific children reading 
such material for sheer enjoympnt. Faced with a situation in Niue 
where English language was jased very little outside the class-, 
room, and the teacher's models of English inside the classroom 
were often at fault, De'Ath ( 1 978) argued that the children would 
learn English more readily if they were provided with a richer 
language? environment in print. While accepting the oral 
component of the Tate Syllabus, he questioned the limited role 
given the printed word for teaching new langauge, and set out to 
develop a new set of readers, known subsequently as the**Fiafia" 
Reading Books. ^Tiafia" in the Niuean language means^happy." 

The distinctive qualities of this new program are as 
follows: 

1. The materials are all high interest stories, based on 
local people and familiar situations. (The school bus 
breaks down; the farmers' pigs escape and get into^ 
mischief.) 

2. Early language was controlled to a certain extent with 
regular repetition, but unusual or ''interest words" 
and new structures were frequently introduced to 
enhance the story. 

3. Each book is well illustrated, usually with a touch of 
humor. 

4. Most of the books are short enough to be read in one 
sitting. 

5. Large supplementary readers are used in a "Shared 
Book Experience" method. 

In this method, the teacher has the children sit around her in a 
group, on a mat. She presents an interestingstory, "blown-up" in 
big print. First, she reads the story to the children; then they read 
.it together. They stop regularly and talk about the pictures— just 
like Mother reading bedtime story. Then the children read the 
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story again. The teacher calls on individual pupils to read 
specific words or phrases, or to answer questions about the 
meaning of the story. A high premium is put on group discussion, 
which arises riaturally if there is a good story. Thus the children's 
own language experience can be complemented and extended. 
The children enjoy reading the stories together, the unison chorus 
approach encourages natural intonation patterns, and there is a 
valuable opportunity to learn basic concepts about books and 
book language, and to acquire new words in their context by 
listening to the teacher or to other children. They learn easily, 
without the embarrassment that comes when the teacher 
identifies and exposes their ignorance. This "Shared Book 
Experience," sometimes called the "Cooperative Book Method," 
is a compromise between reading to the pupils and reading by the 
pupils. Rather, the teacher reads with the pupils— a method very 
popular in New Zealand schools at many levels. 

So, the main differences between the new Flafia Program 
and the traditional spc reading program are that the Fiafia 
Program depends on a good story, provoking group discussion 
about it, and the learning of new words and structures in. the 
context of that story. New'language is learned only in context— 
and a written context at that. 

The SPC readers, on the other hand, are designed to help 
pupils practice particular words and structures. The stories are 
carefully graded to keep out anything not already learned in the 
separate oral lessons; consequently, they lack excitement. The 
children work gradually and systematically through the books, 
two pages a day, much as they might work through a 
mathematics book. 

The situation in Niue .provided an ideal context for 
examining the role of reading in the program of English^as a 
foreign language. It was .possible to compare a complete age 
group of children in grade three (eight years) who had taken the 
Tate Oral English Syllabus in association with the Tate Junior 
Reading Series in 1977 with another grade three group who had 
taken the same oral syllabus in association with the* Fiafia 
Reading Program in 1978. All other instructional and home 
background factors were held as constant as possible. The 



children in both cohorts were the same age; they had been 
learmng oral EngUsh for two years; they had learned to read in 
their vernacular; the hours of formal instruction in English and 
the vernlacular were the same; the teachers involved in the two 
grade tjir^e programs were the same; and checks were made on 
reading perfx)rmance of pupils at the next grade level to ensure 
that there was no systematic growth in English due to unknown 
community or^school influences. 

The experimental context was ideal for evaluating the 
effectiveness of these two approaches to English teaching— the 
audio-lingual Tatk Program, with its minimal role- for reading, 
and the Fiafia Program, with its major role for reading. 
For in the virtual absence of English outside the school, the 
activities of the classroom would show their effects on children's 
language growth in far bolder relief than would be the case where 
pupils learned both inside and outside the classroom. Instruction 
in a foreign language provides a better laboratory context to 
examine the influence of curriculum variables which are so often 
neutralized by the effects of home and community. 

Design of the EvaiuatiGn * 

The design of the evaluation called for the testing oi all 
Niue pupils early in their grade four year. The Tate Program' 
pupils were assessed in March 1978; the Fiafia Program pupils in 
March J 979. Ali grade four pupils on the island were tested in 
both yearns but the children of one of the six schools were omitted 
from the analysis, as that school had been the scene of all the 
trials of the new Program in 1977. Consequently; its pupils had 
not taken the Tate Program in that year. 

Three' tests were devised to assess the pupils' performance 
in English-^/?e/3J/>7g Comprehension, Word Recognition and 
Oral Sentefice Repi fition, The Residing Comprehension test was 
adapted from, a mu-itiple-choice sentence completion test 
designed for Cook Islands Maori children in an earlier project. It 
was trialed also in Fiji a nd Niue (on pupils in the trial scliool) and 
revised/ to form a 35-item, group-administered test with simple 
vocabulary and a local South Pacific iiavor. The choice of 
vocabulary was influenced by a Niue word frequency list 
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prepared by the author in an earlier project. It was not influenced 
by the contents of either the Tate or Fiafia Readers. All pupils 
were given this test in both years. It was administered in all 
schools, by a Niuean education adviser, using both English and 
Niuean instructions and in the presence of\the writer. 

The Word Recognition test consisted V 50 words, graded 
for difficulty, which children were required to read aloud to the 
writer (or his Niuean counterpart). Half of the words were drawn 
from the Tate Readers and half from the Fiafia Readers. The first 
words were simple high frequency words found in both programs 
(hop, look, me, not, and). The last words were more difficult but 
still included in the Tate Syllabus at higher levels (weigh, 
machine, breathe, lightning). Half of the pupils, chosen at 
random, were administered this test in each year. The correlation 
between the results of the Reading Comprehension Test and the 
Word Recognition Test was 0.75. 

The Oral Sentence Repetition Test was introduced 
primarily to test for incidental effects of the two reading 
programs. The test was adapted from one developed and 
validated by Clay, Gill, Glyn, McNaughton, and Solomon ( 1976) 
and consisted of 28 English sentences of increasing grammatical 
complexity. Each sentence was read aloud to a pupil by the 
writer, or his Niuean counterpart, and the pupil was required to 
repeat'it verbatim. The rationale for such tests is that if pupils do 
not have a particular structure in their repertoire, they will state it 
incorrectly and/ or omit the unfamiliar element. Recent research 
shows that it is a sensitive measure of language competence, 
rather than a test of rote memory (Hanayan, Markman, Pelletier, 
& Tucker, 1 978). Each child, in both years, who had not taken the 
Word Recognition Test was given the Sentence Repetition Test 
by the writer or his Niuean counterpart. Three examples were 
given for practice. When a child scored less than 3 sentences 
correct in one of the 4 sections of the test, he was stopped. 

Typical sentences were: 
My brother's knees are dirty. (N be +) 
,the cat is drinking some milk, (nvn) 
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Here comes a great big truck with children riding on the 
back. (Here vn) 

The reliability figures reported by Clay et al. (1976) are over 0.90 
and the correlation with the ReadingComprehensionTest inthis 
investigation was 0.54. 

I n addition to these three objective tests, u brief interview 
was conducted with each child in both years in order to gain 
further insight into his oral language fluency and attitudes 
towards reading. 

Despite two teaching staff changes, for part of 1978, and a 
three month, delay in the arrival of the supplementary readers for 
the Fiafia scheme, tl\e program was taught according to, plan in 
most cases. Some teachers made greater use of the "Shared Book 
Experience'' than did others, the timetable was standardized, 
and the number of supplementary books in the school libraries 
was very similar from year to year. 



Results of the Evaluation 

Table 3 shows the mean and standard deviation for each 
test in both years. In each case there was a dramatic increase in 
mean scores under the Fiafia Program, highly significant in all 
tests. Thus, pupils taught^with a relatively ungraded reading 
program^ which stressed interest and meaning rather than 
structure, produced better reading comprehension, better word 
rpcognition, and better mastery of oral language. Furthermore, 
the interviews showed that both pupils and teachers enjoyed the 
Fiafia Program more than the Tate Readers. Gains were shown 
in 16 of the 18 test comparisons made, and some schools showed 
increases, of more than 100 percent with the same teacher. 
Clearly, a reading-based program of this kind has much to offer. 
Clearly, too, the shaky assumptions which led to the audio- 
lingual neglect of reading are undermined even further. For after 
three years of English teaching under the sPc/Tate Program, the 
quantity of simple words recognized and understood by grade 
four children was little better than chance. 
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Table 3 

Language Test Scores for Tate and Fiafia Reading Program 

Tate Program 1978 Fiafia Progranj 1979 
Language Test Mean S.D. N Mean S.D. A/ Difference 

Reading Comprehension 10,54 5.50 89 14,02 6.63.62 p<.001 
Word Recognition 14.85 12.00 49 29.41 12.39 34 p<.OOI 

Oral Sentence Repetition 6.51 5.00 39 10.9t 6J0 31 p<.00t 

It is difficult to identify the particular elements which 
produced the marked improvements in the Fiafia Program. The 
12 booklets themselves and the 36 supplementary readers are not 
polished examples of the publishers' craft. In fact they need 
thorough editing and the illustrations and photographs are often; 
poorly produced. Nevertheless, the children were happy to read 
and reread them often, thus reducingthe need for repetition to be 

built -into the text. 

There is a strong suggestion in the school-by-school 
analysis that those teachers who made greatest use of the Shared 
Book Experience produced the most growth; those who used it 
less often, produced the least. Reading specialists who promote 
the Shared Book Method (Nalder, 1975; Porter, 1977) would no 
doubt agree that it is an important ingredient, as it has convinced 
large numbers of New Zealand teachers who have made use of it. 
Ritchie (1978) found it the most effective of three programs in her 
evaluation of a Maori preschool project in New Zealand. It is a 
method which capitalizes on children's natural interest in a good 
story. New language is taught at the point when children are 
highly motivated to read. Every psychologist and practical 
teacher knows the benefit of intrinsic motivation. 

If these explanations are plausible, the policy implications 
for second language learners are clear. If children can teach 
themselves new vocabulary and structures, in much the same way 
as first language learners, and they can capitalize on the Shared 
Book Experience in much the same way as first language 
learners, then it is clearly important to loosen up the restrictions 
we have imposed on the language environment of young children 
in a bilingual setting. 
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Conclusion} 

It hj^5 t^^come obvious to many teachers in bilingual 
programs ^^ere is very little transfer from patterned practice 
exercises to genuine or^i or written communication by students. 
Uninspiring^ r^tJetitive drills turn students off, leaving little 
impact on tH^'^ active language repertoire. As Ervin-Tripp 
(1970) points ^^t> a^y learning model which concentrates on 
language inpUt> without regard to selective processing by the 
learner, just A^'f not Work. Apparently, we cannot teach children 
language. ^an only "^ake it possible for them to teach 
themselves. 

Wh^t the formula for better quality bihngual 

programs? ^^^e seen that formal instructional programs have 
rarely been ^li^^^^ssful in teachingchildren a second language. Yet 
children learfJ ^^'^st and second languages quite efficiently outside 
the classroojti. ^hen communication needs are genuine. 

We }>ijv'e seen that second language learners profit best 
from a muiti^^^sory approach, rather than from one which 
neglects the pri^lted Word. We have seen, too, that typical grade 
five chiidr^ti learn new structures in the target language, 
without fortfial tuition, if given ^n opportunity to do so— in the 
form of a n^^a^^'ngful context. Furthermore, we have seen that a 
bilingual pr^^^'^m ^Vhich stresses learning from, interesting 
reading m^teJ^'^ls accompanied, by active questioning and 
discussion j^^ore effective than a formal, carefully graded 
reading pr^g^a^, tied to a tight sequence of orally taught 
structures. 

Tht^c f^^^dings are consistent with the contentions that 
good language learners approach their task best when they are 
highly motiv^te^i to discover meanings and patterns rather than 
required to p'^ctice structures. If "language learning is more 
effective wl^^n ^He learner is often faced with thfe unfamiliar and is 
keen to gu^^S hypothesize to fill the gaps, then audio-lingual 
programs ha^^ ^een guilty of retarding learning by discouraging 
guessing ai^^ P^^eventing children from attempting to work out 
new meanitx^s for themselves. We should encourage the teacher 
. to reward gii^^^ing, to recognize that readers^ deviations from 
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text are frequently signs of positive thought, harmless approxi- 
mations which indicate an important step towards independence 
in language learning. 

The audio-lingual approach has been guilty, too, for 
neglecting the printed word. 1 n the South Pacific, children rarely 
read for pleasure, and teachers rarely read to them. The few 
books available. do not lend themselves to enjoyable reading. 
This state of affairs is clearly unsatisfactory. 

There are probably no simple solutions to speed up 
learning in bilingual context. However, we could begin by 
attempting to exploit the potential of the printed word by 
building up a stock of reading njaterials geared to pupils' natural 
curiosity, their love of narrative, of excitement, of humor, their 
easy identification with mythical heroes and with children like 
themselves. We should.make such factors paramount, and worry 
less about strictly controlled language. In this vein, extensive 
reading, in the shape of deliberate "book floods" in programs for 
bilingual Polynesian children in Auckland have shown much 
promise (Elley, Wat.son, & Cowie, 1976). Furthermore, the 
evidence on the value of reading aloud to and with children is 
now considerable. It is widely used and recommended with 
Polynesian children in New Zealand. It receives further support 
from the Fiafia evaluation described above, and is now being 
tried out in several Pacific Island countries. 

1 n the past, abundant reading has been recommended as a 
means of acquiring information, as a way of gaining insight into 
human nature, as a method of enriching our leisure hours, and 
bringing us into contact with the minds of literary giants. The 
evidence is mounting that it is also a very effective means of 
learning language, of extending vocabulary, and of coming to 
grips with new syntax. There is reason to believe that bilingual 
children can benefit from reading in all these ways. It is surely no 
coincidence that the vast majority of competent bilinguals whom 
the writer has met in the South Pacific were voracious readers m 
their youth. They probably learned their English outside the 
classroom. 
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Let us not then cut our children off from a rich and valid 
source of learning by artificially pruning their language and 
presuming that we can control just how and in what order they 
will learn. Their natural language learning capacity is greater 
than we thought. We are just beginning to learn at what stages in 
their development, and with what kinds of materials and 
methods we can exploit it fully. 
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Characteristics of Exemplary Reading 
Programs 



S. Jay Samuels 
University of Minnesota 



Increasing Literacy Requirements 
and the Need for Quality Programs 

In a technologically advanced society one of the important 
functions of the school is to educate its students so that they will 
be literate. A historical look at the concept of literacy reveals 
rather dramatic increases in the amount of reading skill required 
in order to be considered literate (Resnick & Resnick, 1977). 
With the inflation in literacy skills brought on in part by the ever 
increasing demands of our technologically oriented society, the 
quality of the school reading program has taken on added 
importance. 

Because improving the quality of the school reading 
program is considered to be an important endeavor, educational 
researchers are currently attempting to specify the characteristics 
of successful reading programs with the underlying hope that by 
specifying the components which successful programs share in 
common, it will be possible to upgrade the. quality of the less 
successful reading programs. 

Purpose of This Article 

Researchers are not the only ones who want to upgrade 
the quality of reading programs. Others who share this goal are 
school administrators, faculty, and the public. It seems as though 
schools are under increasing pressuro to be accountable for the 
academic progress of their students. The increase in account- 
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ability is. rnad€ manifest, in several ways, through local and 
statewide evaluations of academic progress, reports- in local 
newspapers regarding the 'scholastic standing of each of the 
' schools in the district, competency, based promotion and 
graduation requirements, and law suits brought against school 
districts for what amounts to educational malpractice. 

This article has bee,n written as a response to the increased 
need to improve the quality of scjiool reading programs. In 
pursuit of this goal, this article will describe the characteristics of 
successful and unsuccessful reading programs, and will touch 
upon the difficulties which school districts have encountered in 
trying to maintain quality reading programs. 

The]Need for Comprehensive Reading Programs 
\,^-"^ In this discussion of program quality, it should be pointed 
out that the term program has a broader meaning than the term 
method. In reading, we may refer to a phonic method, a look-say 
method, or a linguistic method. As viewed here, the 'reading 
method use^d by a teach,er is merely a component within a larger 
entity called the reading program. Of course, the reading method 
one uses is important to -overall program quality, butf other 
components are importantjis well. For example, in addition to 
the reading method, other components which can affect program 
outcome include staff, administration, parents, and students. 
Thus, a program can be thought of as a set of interrelated 
components consisting of human, material, and procedural 
factors which influence the extent to which institutional 
objectives are realized. Armor, Osegura, Cox, King, McDonnell, 
Pascal, Pauly, and Zellman summarized their study of program 
characteristics and achievement by stating . .it is important to 
emphasize that no .single school or classroom factor taken by 
itself is likely to produce large . increases in reading achieve- 
ment Reading instruction is far too complex to- allow for 

"simple policies or 'quick fixes'." 

The plan which will "be followed in this article is to 
describe briefly each of the program characteristics for a number 
of exemplary reading programs. Followingthedescription of the 
programs, there will be a summary which will attempt to 



256 



Samu'els 



delineate the common characteristics across those exemplary 
programs described in this article. Finally, there will be a brief 
section describing characteristics of unsuccessful programs as 
well as the problems one encounters in trying; to maintam- 
'progi-am quality, over time. ' 

Components of Exemplary Reading Programs , 
Found in Six Reports 

Weber (1971)^. In this project, Weber looked tor ghetto 
schools where reading scores were at the medjan for the nation as 
a whole The four innercity schools in which Weber conducted 
his study were P.S. #11 and P.S. #129 in New York City; 
Woodland School in Kansa. City, Missouri; and Ann btreet 
School in Los' Angeles, California. Since the students in these 
ghetto schools were from low.socioeconomic level homes, one 
might have predicted that the level of academic achievement 
would be low. Instead, the level of achievement was approxi- 
mately-equivalent to the. reading achievement found in the 
average income schools of. the United States. 

■The four successful scjiools shared . certain program 
characteristics. Weber ideAtified these shared characteristics as: 

1 Strong leadership. Administrators were aware ot the • 
. problems and strongly supported efforts to raise 

achievement. ' 

2 High dxpectations. Adminstrators, teachers, students, 
and>arents were convinced that the students could 
succeed. Failure was' unacceptable.- , 

3 Both task and human relations orientation. The 
schools had sense of- order and disciphne.-The mood 
was described as pleasant and happy. Although the 
teachers placed great emphasis on academic achieve- 
ment, there was equal emphasis given to helping 

iThccr- , .. If success which Weber used was a national grade norm scoVeas a mediae 
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students feel good about themselves, their classrooms, 
and their racial origin, 

4. Teacher aides. Wherje leacher aides were used, they 
were directly involved in instruction. Use of teacher 
aides in direct instruction changed the student-teacher 
ratio. Thus, instead of one teacher to thirty students, 
there might be two, three, or four teachers working 
with the students. 

5. Additional reading personnel. The reading specialist 
worked directly with teachers and teacher aides.ao 
upgrade teacher skills and to provide teachers with 
needed materials. This is known as the multiplier effect. 
If the specialist works with ten children, only ten 
children are helped; but if the ten teachers are helped, 
who in turn work with 30 childi^en, then in effect, 300 
children are helped. 

6. Decoding emphasis. An early emphasis was given to 
what may be called a. code-breaking subskill approach 
to reading rather than a holistic meaning emphasis. 
This should'not be taken to mean that comprehension 
was unimportant, but that an early focus was given to 
those skills which facilitate word recognition. 

7. Evaluation and quality control. There was continuous 
evaluation of student progress, and evaluation data 
served as a basis for diagnosis of student difficulty and 
the remediation of those problems. 

Hawkridge, Tallmadge, and Larsen (1968)^. This survey 
is important because eighteen successful reading programs were 
matched against eighteen unsuccessful programs. The following 
differentiated the successful from the unsuccessful programs: 

-This study matched successful and unsuccessful programs on the basis of age of students, 
ethnic composition of students, number of students served, and types of learning 
evaluated. The successful programs were drawn from an earlier study (Hawkridge. 
Chalupsky. & R'obcrts, 1968) In which I.OOO compensatory education programs were 
surveyed and 21 of these were selected because they had cleai program descriptions, 
appropriate sample size, description of tests, evidence of reliability and statistical 
significance of the posttest. In addition, the program had to demonstrate "measured ^ 
benefits," which meant that scores on standardized tests had to improve more than they 
would have under regular school programs. 
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1. Explicit goals. The program goals were clearly stated. 

2. Focused\instruction. Classroom instruction was aimed 
at achieving the objectives. 

3. Relevant materials. The materials used were appro- 
priate for the methods used and the goals. 

4. Efficient use of time. There was high intensity of 
treatment. 

5. Diagnosis. The learner's progress was monitored and 
checked at regular intervals and appropriate instruc- 
tion given to each student to overcome individual 
problems. 

6. Teacher trainings Additional training was given to 
teachers before or during the program. ^ 

7. Parental involvement. Parents were utilized either as 
aides or to help students at home. 

Wargo. Tallmadge, Michaels, Upe, and Morris (1972)^. 
The purpose of this study was to obtain a detailed listing of 
exemplary reading programs, define the components of each of 
these programs, and then look for common elements across 
programs. The shared characteristics are listed below: 

1. Academic objectives clearly stated and/or carefully 
planned/ 

2. Teachei^ training in the methods of the program. 

3. Relevant instruction. 

4. High treatment intensity. 

5- Active parental involvement. 

6. F^irsonnel committed to the objectives and procedures. 

7. Close supervision of teachers and aides, 
s/utilization of additional personnel in the form of 
/ teacher aides and/or reading specialists. 

'9. Continuous assessment system providing feedback 
I and diagnostic information— immediate feedback of 
/ results, 

n'wenty-onc exemplary reading programs identified from earlier surveys were analyzed 
for characteristics shared in common. 
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10. Structured environment to aid student on task 
beiiavipr. 

11.1 ntensive stafnng--use of aides. 

New York State Office of Education (1974)\ The purpose 
of this study was to compare innercity elementary schools which 
were matched on social and cultural variables but which differed 
in reading achievement. In the successful schools, the admm- 
istrator had a significant impact on the school's effectiveness. The 
administrator helped to develop plans for the reading program, 
provided planning time for the program, and helped to 
implement the program plan. In the unsuccessful school, the 
administrator took little direct interest in the reading program. 

In the unsuccessful school, teachers tended to blame 
nonschool factors for the low achievement of the students, 
whereas, in thp successful school, the responsibility and locus of 
control were placed within the school and under the control of ' 
teachers. Teachers in the unsuccessful school were pessimistic 
about their impact on students.' 

The successful school climate was one of high expec- 
tations, blended within a warm, humanistic, and rewarding 
atmosphere; whereas, in the less si^ccessful school, expectations 
were lower and there was less of a human relations orientation. 

In summary, the successful school was marked by strong 
administrative leadership, positive expectations for success, 
extensive pupil evaluation, and special training of teachers. 

C RA FT Project Harris and Serwer (1966P. The purpose 
of this study was to compare two approaches to the teaching of 

^Two New York Qity elementary school.s were .«udied to determine what .specific school 
factor inOuenccd reading achievement. Schools were matched Icr soeioeconom.c status 
race?and culture; but one school had significantly higher achievement ,n grade.s two. four, 
"nd six on three measures: functional reading level, word recognition, and compre- 
honsion. 

n-his study concerned itself with gains in reading achievement and whether tho.se gain.s 
resulted from the of approach used in reading or the amo,;,,MV spent tcachin^^ 
reading. A total of 1.141 disadvantaged pupils were drawn randomly from 12 schools 
ocated in the black ghetto areas of New York City The teachers were volunteers within 
the system and were Trained extensively in the specific approaches to be utilized. Both the 
skills centered approach and the language experience approach were compared. Pre- and 
posae'^s were administered to students with the posttest scores adjusted on entering 
scores to account for differences in entering skills. 
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reading and to determine the effec; i.f time devoted to reading on 
the reading gams of students. After one year, the skills centered 

and the languageL.e>:psri.encr centered approaches, were compared, 

and a small but com istsnt difference lavorir;? tbe skills cen •.;rcd 
over the language experience method was found. This finding is 
consistent with the emphasis on phonics, decoding, and subskills 
emphasis found in other programs. 

Another finding .oLinte rest from the CRAFT Project dealt 
with the relationship between time allocated to reading and 
achievement. "Time, allocated to reading" has two components, 
actual reading time and some residual, which represents factors 
such as handing out books, changing groups, and giving 
instructions. As may be expected, this aspect of the analysis 
showed that actual reading time was significantly correlated with 
achievement, and that when large periods of time allocated to 
reading were spent on nonreading activities, such as handing out 
books and giving directions, the results were unfavorable to 
achievement. 

Direct Instruction Model (1977)(>. T,he purpose of the 
study, which was called "Project Follow Through," was to 
compare each of nine different approaches on the learnmg of 
basic academic skills. The approaches, or models, varied 
from global whole-person, individualized child-centered, open 
classroom, language experience approaches, on the one hand, to 
skills emphasis, behaviorally oriented models. What is mter- 
esting to note is that the students in the Direct Instruction Model, 
which focused upon mastery of component skills in reading, and 
which utilized principles derived from the psychology of learning 



'■Project Follow Through was viewed as a program which followed through w.th the 
educational efforts initiated with Head Start. The project was administered by the U S. 
Office of Education, and eventually came to serve 75.000 low-.ncome children annually 
under the guidance of what may be thought of as nine models of ed ucation Of the nme 
models the Direct Instruction Model was ratdd number one in reading. In the strictest 
sense, the project did not use an experimental design, but each model wascompared to its 
own control. The comparisons for each model and its control were made on tests of 
academic skills, cognitive-conceptual skills, and measures relating to self-image. The 
basic data from the above tests were collected by -Stanford Research Institute and 
analyzed by Abt Associates. 
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and instrucUonTfanlced-^M^ matter 
learning but in measures of self-esteem, higher.even~thanTfuclWts 
who were in programs where building self-esteem was considered 
an important goal. 

The' Direct Instruction Model has the following char- 
acteristics: 
Assumptions 

1. All children, regardless of socioeconomic status, are 
capable of mastering basic academic skills. 

2. Teaching failure is not excused. 

3. Children from -disadvantaged homes have less well 
developed prerequisite skills necessary for; academic 
achievement. 

4. Students from disadvantaged homes must be taught " 
more in less time in order to catch up with other 
students. 

The Rationale for Six Essential Teaching Components in the 
Direct Instruction Model 

1 . Teach the general case. By teaching a subset, the whole 
set is learned. For example, by teaching 40 sounds and 
skills for blending them together and saying them fast, 
generalized decoding skill, relevant to one-half of the 

V . common English words, is learned. 

2. Use teacher aides in direct instruction. This increases 
the number of instructors in a classroom. 

3. Daily program is carefully structured. Routines are 
established so that time is used efficiently. 

4. Maintain student attention. This is done through rapid 
paced, teacher-directed, smaM group instruction. 

5. Provide training and supervision. Teachers are care- 
fully trained arid supervised to ensure that appropriate 
skills are taught and utilized in the classrooni. 

6. Maintain quality control. Student progress is moni- 
tored' with bi-weekly, criterion-referenced tests which 
help to detect problems while there is time to correct 
them. 



262 



2'67 



Samuels 



The Rationale for D^ist'mctlve Features of the Direct Instruction 
Model 

1. Scripted presentation of the lessons. The purpose is to 
^provide^quaHty control in the instructional delivery 

system^ ~ — — — - — 

2. Small group instruction. It is more efficient tharTorie^ 
to-one instruction and allows for better supervision 
than large group instruction. 

3. Reinforcement. Although most people would agree 
that learning should be its own reward, there are many 
students who need extensive rewards to encourage 
learning. Hence, token economies may be used for 
students who do iiot respond to games, praise, or 
attention. : 

4. Training and supervision. Teachers are trained to use 
this approach to reading and are supervised to ensure 
implementation of instructional methods. 

5. Program design. Utilizes task analysis, specifying 
objectives, analysis of objectives into component 
subskills, identifying prerequisite skills, selecting ex- 
amples and sequencing the skills. 

Wilder s (1977P Educational Testing Service Study of 
Exemplary Compensatory Reading Programs, On the basis of 
achievement test scores, on-site visits and questionnaire data, five 
schools with exemplary cpmpensatory reading programs were 



' 7ln 1971 the United StatcsOfficeofEducation funded this study of compensatory reading 
programs in grad'es two. four, and six. ETs conducted this study of characteristics of 
examplary reading programs with the pfemise that "the only real difference between 
compensatory and non-compensatory reading instruction lay with the soUrce of the 
funding support, and consequently the study should be free of artificial restraints such as 
funding sources, materials, and characteristics of students. The definition of 
compensatory reading instruction which was' adopted was any readmg instruction 
provided for students because they were reading below grade level. 'Reading program' 
was broadly conceptualized to include the totality of a schdoPs reading instruction 
including demographic and sociological features.** \ 

The effectiveness of the reading program was determined by aschooPs posttestreading 
scores in relation to its pretest scores: Schools which performed better than expected were 
selected for further study. By means of analysis of achievement Scores and site visits an 
original sample of 741 schools was narrowed down to five exemplary schools which were 
examined in depth for their characteristics. 
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selected What follows is a description of their characteristics. 
In all of the successful schools: 

1. Reading was given top priority. This priority was made 
evident through time spent in reading activities and 
money spent on materials and resources, 

2. There was effective educational 'leadership, either 
through the office of the principal or a resource person, 

3. Attention to basic reading skills was the essence of the 

~" progr-am. 

4. A variety of 'niaterials~~^^s--used^JJL_^^ 
encountered difficulty with a skill, alternative meThods"^ 
and materials were available to teach the skill, 

5. There was discussion and cross-fertilization of ideas 
* about reading. This took place as part of the inservice 

programs which are part of the development and 
maintenance'of strong programs of reading. 

Summary of Components of Successful 
Heading Programs across Investigations 

Having described the components of the^successful 
reading programs:.for each of the investigations, it is possible now 
to examine the extent of overlap which these programs share. 
Table 1 provides a convenient means for visual inspection of the 
program characteristics found in each of these successful 
approaches to reading instruction. 

As mentioned previously, a good program is consid.erably 
more than a reading method. It may include factors and variables 
as subtle as assumptions about the abili':y of children to master 
basic reading skills, or the responsibility which theschool hasfor 
helping children master basic academic skills. In addition to.the 
assumptions which underlie the programs, other important 
factors include personnel, training, application of principles of 
learning, and evaluation. 

At the possible risk of ovefgeneralization, how might one 
summarize the con^ : Tuund in successful reading pro- 

grams? 
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Components 

Strong admii^i^tr^Mve leadership 

High expect^^i^^^^ ^nd belief 
that the teaci^^r make ^jie 
difference 

Teacher aids ^^^^ direct 
instruction 

Reading speQi^H^' 




Teacher role ^ Present, 
diagnose, ^et^^d*^^^^^ 

Specific readitig ^^jectiv^s 

Skills center^^ c^^Mculurn 

Instruction ^j^^j ^^terials 
relevant to g(>^|^ 

Structural ei^v/K^^^^nt 

Positive clas^j-^?^^"^ climate 

Efficient use ^f*^*^e 

High intensify ^reatm^nt 

Frequent ev^^jii&^^'^n of 
student progj.^5^ 

Supervision ^ft^^chers 

A + means that ,,£,iTiPVnt wus iocntf»<-'d 
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L Underlying Assumptions 

There are several important assumptions or beliefs held 
by staff in exemplary reading programs which seem to be 
associated with student academic growth. The first assumption 
one often finds is that the school can have a significant impact on 
the academic achievement of its students. Consequently, the 
school is responsible to a large extent for both the success and 
-JaiLure^joMts^st^ The second assumption is that most 

children are^a^ble'ofmastering-t-he basicaca^ If one^ 

assumes these two premises, namely that the school is resporisTble^ 
for student achievement and that most students can master basic 
academic skills, then one does not routinely absolve the school 
from the responsibility for student failure with explanations such 
as the student had "poor motivation/"^ "lack of readiness," or 
"inadequate home background conditions," Instead, when 
students fail to learn, the school assumes the major responsibility 
for identifying and remediating the problem, 

2. Personnel 

Administrators, One frequently finds a strong admin- 
istrative leader associated with the exemplary reading program. 
In fact, the Rand Corporation Report (Berman & McLaughlin, 
1978) on implementing and sustaining educational innovation 
states that the importance of administrators to the creation and 
•maintenance of innovative practice can hardly be overstated. 
Administrative support, in the form of encouragement and the 
creation of an organizational climate which gives the project 
"legitimacy" is essential to the project's development, operation, 
and maintenance. The support which the administrator gives 
is not in "how to do it," but in providing time for planning and 
carrying out decisions, securing necessary financial support, and 
running interference against any counterforces in the school or 
community. 

Teacher aides. In order to reduce the student-to-teacher 
ratio, teacher aides were often used iii direct instruction in the 
exemplary projects. The aides had received training, so that an 
observer in a classroom might find what appeared to be anywhere 
from two to four teachers in a classroom. In actuality, what one 
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was observing was :i teacher and several* trained teacher aides 
working directly with students. 

Reading specialists. A number of projects used specialists 
in a variety of ways, from helping to train staff, to providing 
materials, to senang as on-the-job consultants. The specialists 
did not work as remedial education teachers but worked directly 
with teachers and teacher aides. 

Teachers. The belief system of the teacher is important. 
One important element in the belief system is a feeling of 
commitment, dedication, and support for the project goals. The 
feeling of commitment and support becomes the energizing 
force which motivates the teacher to provide the time and energy 
necessary to implement the project goals. The teacher who 
believes that student success or failure depends upon what 
happens in the classroom and that student failure is not 
acceptable tends to be found in the exemplary reading programs. 
In theHand Report, Herman and McLaughlin (1978) state that 
teachers rise to challenges. Ambitious and demanding projects 
seem more likely to win the commitment of teachers than 
routine projects because the demanding project appeals to a 
teacher's sense of professionalism. A major reason for teachers 
taking on the extra work and disruption of attempting to bring 
about change tn themselves and their students is the belief that 
they will become better teachers and the students will benefit. 

3. Teacher Training and Supervision 

There seems to be strong support across investigations of 
exemplary reading programs for teacher training. Research by 
Berman and McLaughlin (1978) indicates that educators and 
goverment officials have been too optimistic about how much 
time is needed to produce change in an educational system. Their 
research indicates it takes two years to get a project off the 
ground, two years to implement a project, and an additional two 
years to produce a stable effect on student achievement scores. 
Thus, it might take six to seven years before one could expect 
stable achievement gains. 

The teacher training which takes place occurs during the 
two year start-up time and the two year implementation time. 
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The Rand Report spates that helpful training experiences were: 
Concrete, teacher-specific 

Regular meetings which focused on practical problems 

Teacher participation, in decisions 

Teacher observation of similar projects or model lessons 

Additional factors found in the successful innercity schools were: 
Classroom assistance from project staff or 'reading 
specialists i 

Local materials developed 
Principal participation in training 

In order to insure implementation of prescribed teacher 
practices, some of the projects supervised instruction. 

4, Curriculum 

Objectives, The successful reading projects had objectives 
which were clear and specific. Although the primary focus was 
what might be thought of as "task orientation," there was a 
"human relations orientation'' as well. The twin goals reflect the 
fact that the best environment in which to help master academic 
skills is one in which students find the classroom to be a friendly, 
warm, and supportive place to work. In other words, when there 
is task orientation combined with positive classroom climate, the 
situation is conducive to academic growth. 

-Skills-cenlered^curriculum This approach to reading 

utilized what may be callera:"siJbskill^oriem.atign^^ 
With this . approach fluent reading is thought of as a Highly 
complex behavior which can be analyzed and separated into its 
components through a process called task analysis. The 
components are then, taught to beginning readers in a sequence 
which is determined partly by the task analysis, partly by logical 
analysis, and partly from what we have learned through research 
on instruction. In order to help the student integrate the subskills 
into the higher-order behavior required in skilled reading, in 
addition to the instruction subskills, the student is given ample 
practice in reading meaningful and interesting material in 
context. 

. Samuels 



Relevant msiruction and materials. The successful 
reading programs had clear and specific objectives and the 
subskills which were taught were derived by means of task 
analysis. What one tends to find in the .exemplary programs is 
that both the type of instruction .and the materials were 
appropriate and relevant for achieving the specific objectives. 

5. Time 

" Time is an important factor to consider in learning and 
achievement. While it is essential to allocate a sufficient amount 
of time so that learning can take place, it is equally important that 
the available time is. used efficiently. In successful programs, 
teachers devoted more class time to task-related activities, and 
the major part of the school day was spent in structured 
activities that left little unoccupied time. In addition, instruction 
was kept at a low level of complexity. The classrooms were 
described as being orderly, since less time was wasted on 
discipline problems and giving instruction onmutine matters 
such as passing out books and transition from one activity to 
another. In summary, in several of the successful programs, extra 
amounts of time were allocated to reading; and in ail programs, 
time was used efficiently. 

6. Structure .... 

Stucture is an important vehicle Tor utilizing time 
efficiently. I-n a sense, structure is part ofgood management skills 
in which routines are established in order to facilitate the 
movement of students and materials. The successful programs 
■ helped teachers to establish good management skills so that time 
. would not be wasted. Th^s concept can be extended.to the point 
where the most efficient vi^ays to communicate ideas to students 
were developed and shared with teachers. 



7. Evaluation • , 

Student progress is monitored through frequent testing 
on each of the subskills so that quality control can be maintained 
through diagnosis and remediation of student problems. 
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Characteristics of Unsuccessful Reading Prqgrqfns 

While .it is useful to know the characteristics associated 
with successful reading programs, if we wish to build prograyis 
which will increase reading achievement, it is probably equally 
important to, know what is associated with those programs which 
were unsuccessful. With this end in mind, we shall summarize 
several reports which have described characteristics of programs 
which did not significantly improve achievement (Armor et aL, 
1976; Berman & McLaughlin, 1978). 

1. Improper program iniiiaiion and implemeniaiion. 
Before ./ program can work there must/E5e a climate of 
enthusiasm, commitment, and |iipport for it from the personnel. 
It is this .climate of commitment and support which motivates 
the staff to implement and translate the project goals into 
effective classroom practices. The following approaches to, 
ini:;atiori fail to create the necessary climate for support and 
implementation. 

Opportunism, The motivation for the initiation of the 
project is opportunistic, representing a resppns^tp political 
pressure or an attempt to get federal money.' Opportunistic 
initiation is characterized by lack of suppor^and commitment 
from both central office and project staff. Since there is lack of 
support "at any level, the staff is unwilling to invest the resources, 
time, and energy necessary to implement the project. 

Top-down motivation. Although thie central office staff is 
sincerely interested in the program, they fail to win the support of 
the project staff} Consequently, the project staff does not invest 
the time' and energy necessary to implement the project. 

Bottom-up motivation. The "grass roots'' enthusiasm of 
th- instructional staff is not^ matched by the central office staff. 
Consequently, there may be inadequate project implementation 
because the central office fails to allocate a sufficient amount of 
resources and, staff services. 

2. Inadequaie incubation and development time, A 
reading program is broad in sco|ieJn.corporating elements wjiich 
include personnel, methods, materials, finances, and clients. 
Changes which affect this many elements require considerable 
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periods of time for planning and implementation. C 
research suggests that significant innovation requires t' :.. 
of planning and incubation time, two years for implemt 
and two years to produce a stablfe effect on student achicv. 
Those projects which were unsuccessful rushed forwardtoo soon 
and ofien failed to cr^?,t" »he necessary climate for district 
support and compTiitnv ' . , \ 

3 Narrow,' p,'"- -approaches. There are so many 
important elements in • .... d reading program, that simple, 
"quick fix," single element approachlss usually cannot produce a- 
significant impact on achievement. To produce a significant 
imrfact, a comprehensive approach -operating on a student 

over time is required. 

4. Lack of commitment, feelings of inadequacy. The less 
successful projects tended to have staff which either did not have 
.a feeling of commitment and dedication to the project plan or 
Jacked a sense of efficacy regarding their ability to bring about 
the necessary changes. 

5. Unsupportive principal. The school principal is a vital 
eiemem, and in the unsuccessful projects, the principal often 
failed to create norms which emphasized hard work" and 

-dedication and an environnnent which supports teacher efforts 
and protects the teacher from disruptive forces. 
- 6. Inefficient use of time. Tinie is a critical variable in 

^learning. It is obvious that if mastery is to occur, the amount of 
time allocated must' be commensurate with the needs of the 
students. However it is important that the allocated time be used 
efficiently. In the unsuccessful classrooms, time was wasted 
because routines, were not , established and there were. often 
interruptions brought about by discipline problems. 

Problems in Maintaining Quality Programs 

The purpose of this article has been to delineate the 
components found in successful and unsuccessful reading 
programs in the hope that this infprmation would bepf help to 
schools in establishing more effective environments in which 
students would acquire literacy skills. Although the information 
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regarding components'-or successful programs is now available, 
this information is only a starting point. 

Schools which currently have successful programs are 
faced with the task of maintaining their quality programs, while 
the less successful schools have additional obstacles to overcome. 
They first must improve their current programs and, once they 
have improved their programs, they must then endeavor to 
maintain them at quality levels. The Rand Corporation Report 
(Bt/rman & McLaughlin, 1978) contains some sobering conclu- 
sions regarding the establishment and maintenance of quality 
program^s. First, they found that successful projects were not 
easily disseminated to new sites and the performance levels in the 
new sites often fell short of the performance level in the original 
site." Second, everi the original successful projecl> had difficulty 
sustaining their succfcss pver a number of years. Third, often 
when the original funding agency withdrew its financial support, 
the.school district was unprepared to provide the level of .support 
necessary to sustain the project. 

These sobering findings regarding the difficulty of 
fostering and maintaining change are neither surprising nor unique 
Ij xhe fiel.d of education. We still have much work to do. While we 
seem to ki:ow a considerable amount about the characteristics ol 
. successful reading programs, we still need to know more about 
which' of these characteristics • uniquely define the successful 
programs and are not present in the unsuccessful programs. 
Although the goals are difficult to achieve, the importance of the 
tai^k is such \h'xi it seenls to be well worth the effort. ^ 
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i 

Twenty-five years ago, the U.S. Supreme Court reversed its 1896 
decision and ruled that "separate but equal" schools for blacks 
and w+iites were inherently unequal in educational oi^portunity 
and, therefore, unconstitutional. This decision initiated desegre- 
gation of school districts and introduced a new and different as- 
sumption into the concept of equality of educational opportuni- 
ty, ''the assumption that equality of opportunity depends in some 

- fashion upon the effects of schooling"'(Coleman, 1%8, p. 15). I'n- 
der the 1896 decision, judgments on equality of educational op- 
portunity were based on equality t>f educational input, which 
was assessed by such criteria as cost of buildings and supplies, 
teachers' salaries, and academic preparation, class sizes, etc. 
However, education of exceptional children, even prior to 1954, 
required unequal input: class sizes were much smaller and frc- 

' quently required special facilities and equipment. In effect, the 
r954' Supreme Court decisiCn expanded the, principle under 
which exceptional children had been educated to now include all 
^jthnic groups. • 

Moreover, the responsibility for attainment of achieve- 
ment shifted from the student to the educational institution. 
After ' tracing the evolution of the concept of equality of 
educational opportunity,. Coleman ( 1 968, p. 22), summarized the 
shift of responsibility in these words: 
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rhcimplicalioii ot the most recent ecineepl,,.is that theresponsr iily to 
ereate aehie\einenl lies with the educational Institirtinn. m)t the child. 
The dilTercncc In achievement at grade 12 between the average N.egai . 
and the average white is. in elTcct. the degree ol ipcquality of oppiirtuni- 
ly. and the reduction of Uiat inequality is a responsibility of the school. 
This shift in responsiblirty follows logically from the change in the ci)n- 

. ' '^ccpt of equalitv of educational opportunity from school resource inputs 

to effects of schooling. When that change oceuri . d. as it has in the past 
few years, the schooPs responsibility shifted I"m ^ increasing and dis- 
tributing equally /rv "quality'* to increasing the quality oi' Ms Min/cnis^ 
■ achievements. This is a notable Miift. a- one which should have strong 
consequences for the practice of educauon in future vcars. 

Thus, the Supreme Court decision had two major educational 
consequences: 

1. Schools had to desegregate their students, whether or 
not they had been segregated a) cic jun\ for example, through 
gerrymandering of school boundary lines or b) cle fac ta, lor-ex- 
ample. through residential location. Consequently, desegrega- 
tion meant that .schools had to sec' inixtut cs of majority and mi- 
nority students an.J eventually teachers in each school in propor- 
tions that approximated their representation in the population. 

2. The concept of equality of educational opportunity had 
to be assessed on the effects of schooling. Of course, the concept 
of equal effects did not mean that all students were to be alike in 
achievement, but at least averages and ranges of ind ividual differ- 
ences in achievement among schools were to be approximately 
the .same. Con.sequently, when a complaint was filed in court that 
^a California school district was practicing f/f^ym-e' segregation be- 
cause the reading achievement scores of students in its schools 
with predominantly black students were lower than those of stu- 
dents in itN schools with a high percentage of v. hite students, it 
was upheld,hv the court. Therefore, this school district was or- 
dered to desegregate its students (Pcna vs. Superior Court of 
Kern County, 1975). 

But desegregation did not free a school district from the 
responsibility of attaining equality or output for its ethnic minor- 
ilies. Compensator, education was stiU nece.ssary because the 
magnitude of achievement among eihric groups throughout the 
c'ountry was unequal, as sh^nvn by ^'Hquality of Educational Op- 
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porlunity" (Coleman et al., 1966), a survey more popularly 
known as the "Coleman Report." 

Purpose of This Review 

\Vc shall review the Coleman Repoit and selected studies 
of compensatorv education that have been conducted over the 
-ast 25 years. These studies include 1) the effects of desegrega- 
tion upon the edircational achievement of majority and mmonty 
groups in one school district and 2) the result? of two major-ex- 
periments to improve the education of disadvantaged Jiildren: 
"Project Head Start" and its sequel "Project-rollbw Through." 
Our purpose in reviewing these studies is to determine what eiiect 
the new concept of equality of educational o[^t)ortunity has had 
upon the academic achievement of educationally disadvantaged 
children. We shall start our review with the Equality of Educa- 
tional Opportunity Survey. 

Eqiialiiv of Eiliuaiional Oppoi tiiniiy Survey 

The Equality of Educational Opportunity Survey was au- 
thorized bv the Civil Rights Act of 1964. Its purpose, as stated in 
.Section 402 (Colemai. et al.. 1966. p. iii) was to "conduct a sur- 
vey.. .concerning the lacl^'of availability of equal cduciitional op- 
po'tunitics for individuals by reason of nv.;e. color, religion, or 
m.uonal origin in public educational institutions at ail levels m 
<!), United States, its territories and possessions, and the District 
( o.umbia." The groups investigated were "Negroes. American 
Indians. Oriental Americans. Puerto Ricans living in the ccnti- 
nental r:..i:ed Sta'es. Mexican Americans, and whites other than 
Mexican Xmericans and Puerto Rici-ns often called 'majority" or 
simply -white"." The survey focused on four major questions. 
These are the Survey"s questions and its answers, almost verba- 
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1 What is the extent to which racial and ethnic groups are 
segregated from one another? The survey found that the "great 
majority of American children attend .schools that arc largely 
segregated."" Among minority groups. Negroes were most segre- 
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gated, that is. tlicy attended schools where their fellow students 
were of the same racial background as fhey were. In fact, more 
than 65 percent of all . Negro-students wereattcndingschools that. 

were between 90 to 100 percent Negro. This figure changed by 
-grade P to 66 percent who were attending schools that were 50 or 
more percent Negro. The same pattern tended to hold also lor 
teachcRS of Negro and white, although not as strongly. "On a na-. 
tioriwide basis where teachers and sturlents are not matched, the 
trend is in one direction: White teachers teach Negro children but 
Negro teachers seldom teach" white children." 

2 Do schools offer equal educational opinn t unities in : 
terms of input characteristics? White children were attending ele- 
mentary schools with a smaller average number o( children per 
room (29) than any of, the minorities Whoso cla.ss sizes ranged 
from 30 to 33. except in the non-raotropolitan North and West^ 
and Southwest where the pattern was reversed.' At the high 
.school level, the average was 22 white students per teacLcr and 26 
Negro students per teacher, but in .some regional areas such a.< the 
metropolitan Midwest, iho difference was far greater- 'he average 
white had a ratio of 1 teacher 33 students and the average Ne- 
gro had ! teacher to every 5- pupils per room! 

h, access t.o facilities, 'be Negro had fewer "(acilities that 
seem most related to academic achievement: they had less access 
to phvsic. chemistry, and language laboratories, lewer booRs pcr_ 
'pupil'in their school libraries, and less sufficient supplies of text- 
books Thev also had less access to curricular (college prep) and 
extracurricular (debating and school newspaper) programs that 
would appear to have a relationship to academic achieyemenL 
Moreover, the average Negro pupil tended to have less able 
teachers than did white pupils as a.sse.ssed by types of colleges at- 
tended vears of teaching experience, salary, and score on a 30- 
wo'rd vocabulary test. Th.e cUissmates or peer group ol the Ne- 
groes as compared with the whites were less often enrolled incol- 

■I' ..ever the W.St . tor t-xamplc. California. w;>.< not l>cft>( ,/<./,/«• ami ,/<•/". '''■'>;>;^;';*=;;;^ 
and o.h.r educational polices and practices that were .Lscruntnatory against ,ts 
minority groups (Hc-idrick. 1975). 
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lege prep curriculum and had taken fewer cour cs in English, 
mathematics, foreign language, and seieneei 

3. How much do stiulcnts learn as measured hy per/or- 
mance on standardized achievement tests?^ The Survey found 
that "with some exceptions— notably Oriental Americans— aver- 
age minor y pupils score distinctly lower on these tests at every 
level that the average white pupil." A difference was already pres- 
ent at the first grade level. On a standard scale where the average 
score r> 50 and the standard deviation is 1 0 (which means that 16 
percent are above 60 and 16 percent are below 40), the scores for 
Negro and white revealed these differences: . 

-| ^.^y^^ Median Sc(irL's 

Nc^ro White 

Nv^nvcrhitl . '^^-'^ . ^-^-^ 

Verbal 45.4 53.2 

At twelfth grade, the differences had increased: 

Usi> Medi;in Set>res 

Neurd While 



Nonverbal 40.9 .52.0 

Verbal 40.9 52.1 



Tests Metiian ores 

Negro White 



Reading 
Maifieinaties 

(ieneral Inlormatinn 4().< 



42.2 ' 51.9 
41 y 51.S 



Average ol the 
five tests 



41.1 . 52.0 



•The EflOS Survey used verhul ability as an index of achievcmeni. Some evalualors a- 
greed with this use (5fa verbal ability test. For example. Dyer (1972. p. 516) stated that a 
verbal abiluy test ''correlates so highly with measures of achievement in uading. mathe- 
matics and factual information tha'. it seivesa.sa useful surrogate for the measurement of 
other forms of learning.** Hov^evcr. verbal ability is also an index of general mental ability. 
As such, it is an index of both schooW;i?r/ext rase hool factors. Henee.it is not a.v/)('f/y/( nor 
a sensitive measure of school achievement. 
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In grade level equivalents, the gap between Negroes and whites 
showed a progressive increase with age and grade in school. In 
the metropolitan Northeast, the gap was about L6 years behind 
in grade 6, 2.4 in grade 9, and 3 J in grade 12, However, some mi- 
nority children, of course, performed better than many white 
children. For example, about 50 percent of the Negroes scored 
better than 16 percent of the whites. 

4. What is the relationship between achievement and 
school characteristics? a) When socioeconomic factors were tak- 
en into account, differences between schools accounted for only a 
small fraction of differences in pupil achievement, b) Variations 
in facilities and curriculums of the schools accounted for relative- 
ly htfle variation in pupil achievement or standardized tests. 
However, variations made more of a difference for minorities 
than they did for whites; for example, existence of science labora- 
tories showed a small but consistent relationship to achievement, 
c) Quality of teachers showed a progics^ 'vely stronger ijelation- 
ship at higher grades; more so to minority than to majority stu- 
dents. The greatest relationship was first between teacher's score 
on the verbal test and student achievement and second between 
the teacher's educational level and student achievement, d) Pupil 
achievement was also "strongly related to the educational back- 
grounds and aspirations of ether students in the school," The 
principal indices for these variables related to achievement, were 
proportion of homes having encyclopedias and proportion of 
students planning to go to college. 

After analyzing achievement .scores of children ola given 
lamily background who had been put in schools of different so- 
cial composition where the achievement was at quite different 
levels, the survey arrived at this conclusion: 

Thus . if a wnilc pupil from a home that is strongly and effectively sup- 
portive of education is put in a school where most pupils do not come 
from such homes, his achievement will he iittle different i han if he were 
in a SL-hool composed of others like himself. But ifa minority pupil from 
a home without much educational strength is put with schoolmates with 
strong edueational background, his achievement is likely to increase 

I he average white studcnt\s achievement seems to he less affected hy the 
strenuth or weaknesses of his schooPs facilities, curriculums. and tea.ch- 
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CIS than IS the average minority pupils. ...the inlcrence mii^ht llicn he 
made that iivpioviiiii the school of a minority pupil may increase his 
achievemem mon: than would improviuii the school ot a whUe Juki 
incrca.se his, 

A mechanism that could mediate such an effect upon 
achievement of minority pupils in integrated schools is lateral 
transmission of peer group values (Wilson,. 1963). This 
mechanism implies that minority pupils in integrated schools arc 
inlluenced through interactions with classmates from the 
majoritv group and would consequently tend to acquire and act 
.upon the values which underlie the achievement of majority 

Although the inferences of the Survey and the mechanism 
of lateral transmission of peer group values appear to be plausi- 
blL\ the inferences of the Survey were based upon symptomatic, 
not upon experimental evidence. However, about the same time 
that the Coleman Report was formulating its inferences, an event 
took place in Riverside. California that provided a natural-type 
situ/ition for experimentally testing some of them. ^ 

A Natiiralislic Experiment in Riverside, California 

In 1965 the Board of Education of the Riverside Unified 
School District acting unusually rapidly decided to desegregate 
its schools. Within s -n weeks, from September 3 to October 25. 
1965. the Fknird went from "initial minority protest^* to "appro\al 
of a detailed integration pjan^^ (Singer & Hendrick. 1967. p. 145). 

The Riverside Unified School District is a medium si/.ed 
school system with a total school population of 25.600 and a ipi- 
nority enrollment at the time of integration of 6.1 percent Black . 
10.7 percent Mexican American and 1.7 percent other minority 
groups. Integration in Riverside consisted of closingdown two of 
its three completely segregated schools and phasing out the third. 
Pupils from the segregated schools were then bussed to "receiving 
schools.^^ The school board\ plan called for minority enrollment 
in each school to approximate the same percentage as wa.s en- 
rolled in the district. With this decision on October 25. 1965. 'VA' 
facto se^rci^aiiou had been changed by virtue of board policy into 
de facto integration' (Singer & Hendrick. 1967. p. 145). 
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Although this integration polic-y was justified on the basis 
of moral, social, educational, and legal reasons, including a 
broad i'nterpretation the 1954 Supreme Court mandatefHen- 
drick, 1968), school uistrict personnel and university prolessors 
joined in a cooperative venture to evaluate the effects of integra- 
tion oil achievement and adjustment of the Anglo, Black, and 
Mexican American groups. However, this .eport focuses onlv on 
the reading achievement aspect ol this longitudinal investigation, 
includinu on onlv those st.iucr.ts vIk. were tested alter 
integiaiu'u 'uid occurred and who were then retcsted in grade3.9 
and again in grade 6.2.-' Only those students who were tested at 
each of these times were included in the "purified" longitudinal 
sample. Students who were not tested at each time or who had 
moved away were not included. The purified sample was then 
compared with a cross sectional sample whica had been tested in 
1965 and constituted a "baseline" control group. Although it 
would have been more desirable to have had a longitudinal con- 
' trol group that did not e.Kperience segregation, the 1 965 cross sec- 
tional group nevertheless appears to have been an, adec^uate con- 
trol group as indicated by the results of the e.Kperiip<-nt. B^th the 
longUudinal and the baseline control groups too^lcthe same Stan- 
ford Achievement Tests (SAT) in grades 1.9 and 3.9..but in giade 
6 ^ the cross .sectional group took the SATwhili^the longitudjinal 
group had the Comprehensive Test of Basic SkiHf The tests that 
were administered are shown in Table 1. \ 

\ 

Table 1 

Tests Administered to Longitudinal and Cross SecUonal Groups 



I ij .St;inriiril Acliicvcmcnl lest, I'riiiuiiy 1. l unii W 

. _•,'(, Stanliiicl Achievement lest. I'limiif;. II. Form X 

j,2 Sl.inlord Aehicvemcnl lest. Intermediate II. I "im W, 

(cross sectional groiipl and Oimprehensi\e lest ol 
Hasi^ Skills (loneitudinal grmip). 



Mncliul tiK-mlormaiH.n rcMCucdJierc has already been reported .n S.nuer I I'nd) a nd in 
Sinuer, Oerard. .iiul Kedlearn ( I'^Si. 
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The grade equivalent scores for the longitudinal and cross 
sectional groups in grades 1 and 3 were based on the same test and 
norm groups, and therefore are dircc ly comparable. But the lon- 
gitudinal and cross sectional group took different tests m grade 6, 
therefore, the grade equivalent scores for these groups are only 
comparable to the extent that the tests are highly correlated and 
the norm groups are equivalent. 



6.0 - 



5.0 - 
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4.0 



3.0 
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1.0 



Anglos 




i.y 



6.2 



3.9 
GRADE 

Fipure 1 Mean reading achievement for the longitudinal (solid lines) 
and cross sectional (dashed) samples of Anglos, Mexican-Americans, 
and Blacks.** 

; 4Sin^er H Gerard H & R^arn. D. Achievement. In H. Gerard & N. Miller (Eds ). 
permission. 
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The results of the reiid'^*^j^'''^K^r<;S ^'"'^ slif petwe,,*^^^ 
I. The differences at the third ''^ /\ ^f^'^^ '^^^'^ trols ij^^ tli^ 
longitudinal groups Und their ^f^/ ^ Kii*'^''"'^ c(7^nJng^^^*^ n^'' 
statistically bi^jnificant. Insteii*^' ior tht? rcsK^'in"' 

sar>;)lc and iheir respective "^'^'^^ ^'(iri""°' 8'"<'^i of cli^'^^b''-' 
e:.ch 0;i. r so cloSely that th^y '"^'J A^^^Hmt'^^ "^'-^ ^'''[i^P cil^'"^ 
the l%6grovipsas baseline cof^.^^" / '"V ' fn general- ' tetcriQ^^^ iH' 
dicate iliat integration had n'-''^ /' \vVat)le nor ^'^ievcn, t^'' 



dicate iliat integration had n'' ( '.V bat'''^ "orf jevcn^^** t^'' 

facts upon the rates ofdevelof?'^^''^ fCadiaga^ ot lo^^J^'^t 

the majority and minority gr^''^ w/V'\figlo'^ did por ^| >n 

chievement, as their parents 1^^^ V' V ^o"' had tti^ 



ts 1^^^ J" V ' 



was at or 



■ the ££05 Survey: Tlit-' ,V tv\l0 reading group 'htif" 
or slightly ahead of tht' ^^^y ^\\iilent norf^.rade .^hi''^ 
the Blacks and Mexican An> '"'ii'y''^ A ^bout tw^^periti^^lui^ ■ 
alent levels below the norm \iiral-type^j^c infe^^\t 

the Riverside desegregat'-- ^ j^^tS^PPor' '.^fd dej;*^^'^'-''^ 

of the Coleman Report ^|^port occU'^jie Coi^it^ ^ 



situation that should I: 'iyH.,'\v'""'''^'e t'^ JructiQ 

Report's inferences sin^c a> '""'/Wi''" ""^ rlass, ,^ tli^ 



:y to teach towards ir'' ^"/'NJ'' Ic^^' "''''^'^ .^tude, ^^u"^ 
imulated and P'-rb^^f J^^ ; H^K^j minority . ion q,!^ t"" 
higher achieve-^.-: ■i?"'^ V^^^gl traasm'^^jian' i,^ \iC^^ 
group values was n- ^' 'ikeiy ^° '^/'''^A |liversid^ cvcii i^.^tH'^'' 
places in the county where t''"'' •^/fv ' V ■'^'iiUi./^hC''' 

for even after integration tl'"''" o'' "^''-ior''' *^^s 



Riverside constituted SO p^fC^ ^^ ^ Jass/ 



MIowev. ■ policies or reducing schoul -^-'f \ -i'l^^^'' ^^'^^^ ^Vu^! in 

largest sc .u)l disiricis** in the UnlicU S^^' ..kiY ^^'"l i" ""''''"'^ ^ /^'i^ih 
vcars I9^.-I971 -whites with childrer, -ji K^^ ur^' •^'^^"'"^^ ^*''''h'^^^ (^^N-or^^""- 

iions ot blacks.- Although these f indiC' V J M\ ''^"^ ncvrrt^^.;|i^^^;l dis,^:;^l,nian 
( 1976. pp. to suggest that the f'' J VV*''" ^^'rg*-' ''"^^^Ms lj>"ls '''''' 

not iichieving its purpose ol' "incrcasin^^ c» ^K^Xm ^'^^""^ ''""^''"N H^'^'" 

ever, in the next to largest schooldistriu^^-* ' ^'h -^^k'^^ ^''•^ 

not related to "the extent o| descgrciii*^' <l?A,.. f ' ^^^'t^^^''-- '\ .Sni^-' 

(Jackson. 1976:i012rHcnccrTt app>'; ^''^ ^^'h " 

cntial utiect related to dislncl si/c .n, of^^^-h^.j^ 
iricls anil its intcniled citecl in siiuilj^.r " ^ ' ^^v^ 
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I'incs 11067) obscrvcci Bcrcitcr and r.ngclmanns pre- 
school in action. She reported that the children, in groups of live, 
were taught in the most direct manner possible. Kor example, m 
language development, the children were taught as though they 
weix learninu a foreign language; the child repeated verbatim the 
teacher s init'ial statement about an object.ora picture and then 
was expected to give a complete answer to a simple question 
about the object or picture. The program consisted oi three in- 
structional periods ol 20 minutes each, separated by hall hour re- 
cess periods lor singing and games. Hven during these recess peri- 
ods the children .sang specially composed .songs and played 
games that were designed to teach them the content that had been 
stressed liuring the instructional periods. 

Although Hereiter and Hngelmann's emphasis upon aca- 
demic content\it the preschool level was controversial lor its 
time Pines observed that their students seemed to enjoy the pro- 
gram and appeared to be making excellent progress toward 
their objectives. Her observations were correct: Bereiter and 
Fngclniann reported that a group of 15 educationally disadvan- 
taged children who had started in their preschool at a median age 
oM years. 6 months had achieved enough alter 9 months of in- 
strucUon to qualify in reading and arithmetic for entrance to lirst 

grade. . 

A similar tvpe of preschool at the University ol Chicago 
( Fowler 19^5) stressed general cognitive development. Children 
engaged in what appeared to them to be a game- consisting ol 
findim- and manipulating objects, but the game was actually de- 
signed^o develop counitive abilities, such as making discrimina- 
tions amonu objects and then abstracting and generalizing cer- 
tain eharacu-ristics of these objects. For example, the children 
vere instructed to find all the pictures which show people 
workinu in factories. For the development ol language ability 
the children participated in the formulation of stories narrated 
around objects and scenes manipulated by a storyteller. Mneeali 
objects were labeled and defined for the children, they also had an 
opportunity to develop their word perception along with their 
'vocabulary. 
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Tlic Bcreiicrand Hngclmann program can be caicgori/cd 
as "^//^d'cY ifistna tiour icacliing siuclonis what ihcv need to Icani. 
In contra^, the Chicago preschool program fits into the rubric 
of indirect instruction emphasi/.ing coy^nitive ch've/opmeni, 
which provides instruction aimed at improving cognitive pro- 
cesses and abilities. This training was presumably neccssarv for 
subsequent academic achievement. Both types cjf programs were 
subsequently given field trials and compared alongwith other 
models in a social and experimental educational program known 
as "Follow Through," which was one of the two largest federal 
experiments for exploring ajternative approaches for delivering 
educational and social services. 'I*he other, which preceded Fol- 
low Through and which had broader objectives than the pre- 
school, but was also for disadvantaged children, was known as 
"Project Head Start." 

Projoci Head Siun 

Even while desegregation was beginning as the sole eom- 
pensatory education approach, later to prove inadequate for at- 
taining the goal of equality of output, intervention programs 
were being initiated. The largest was the federally sponsored Proj- 
ect Head Start which began in 1965. It had the purpose of pro- 
viding compensatory preschool education for poor children so 
that'vchildren from low income families could enter school on an 
equal basis with their more economically fortunate peers. The 
hopCiWas that this type of intervention at the preschool level 
would, contribute not only toward equality, of educational op- 
portunity but also toward the disruption of the cycle of poverty. 
The objectives of the program were therefore comprehensive; 
they included provisions for the child, the child's family, and rela- 
tionships among the child, the family, and society. The provi- 
sions for the child were concerned with his or her health and 
physical abilities, mental processes and skills, expectations of 
success and confidence in future learning, relations with his or 
her family, attitude of responsibility toward society, and sense of 
dignity and self worth. The program also aimed to develop the 
ability of the child's family to relate in a positive way toward the . 
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child and 10 increase the (aniily's sense of dignity and sell worth. 
Lastly, the program was tb. provide opportunities for society to 
work with the poor on solving their problems.'' 

The eomprehcnsivencssV Project Head Start's objectives 
had fostered the development of a v^iriety of preschool models. In 
1969. the U.S. Office of Education initiated an evaluation ol the 
. models that had been well formuiat'ed and tested, and whose 
sponsors' were using the principles of Iheir models for imple- 
menting programs for elementary children in Project Follow 
Through. The purpose of the evaluation, which was known as the 
Head Start Planned Variation Study, was to provide information 
to communities on a) processes for implementing preschools. b) 
kind.^ of experiences provided in preschools. c) effects of different 
intervention programs on children and their families, d) compari- 
son of intervention effects in preschool vs. primary grades, and e) 
the advantage of continuing the same preschool strategy into the 

primary grades. 

Eight models met the criteria for inclusion in the evalua- 
tion. Table 2 contains brief descriptions of these models, which 
were grouped int Ahree categories on the basis of their stated 
objectives and proced\ires. These categories, determined by the 
Stanford Research Institute. which did the eva'nation. were de- 
fined bv Bissell (1973. pp. 69>7G).as 1) Prccn adrnu: . which em- 
phasizes development of preschool skills, such as numberand let- 
ter recognition, reading, writing, and instructional language, and 
uses sys^tematic reinforcement as an instructional technique: 2) 
Cof^nilive Discovery, which promotes growth in basic cognitive 
processes, such as categorizing, differentiating, abstracting, and 
inferring bv use of adult verbal accompaniment to children's 
sequenced exploration: and 3) Di.svovery. which encourages free 
exploration and sell-expression, and stresses the "child's sense of 
self-worth, trust in ailults and the world, and respect lor others." 



' A rcporl on this rr.'icu was ma.lc h> Hisscll ( l')7?). Mwcl. ot the micrma.ion I<h this sec- 
lion comes Irom licr rcptui " \ 

\\ sponsor is anyone Nvho .i.recicti a spcc.tic model, whether u was.i person, a group . a 
univcrsily. or a private corporation 
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F-acli ol the models was" then inipleineiited in two eombninities 
and evaluated at the end ol" uie year. 



Table 2 

Head Start Planned Variation Models'* 

I'rcacadcmic Programs 

\. Academically Oriental Preschool Moilei sponsored by Wesley 
Hccker and Siegfried Kngelmann. Universiiy of Oregon, Emphasis on 
learning skills in reading, ariilirnetie. and language ;hri>iigh strucliired 
drills and reinforcenieni leclmiqiies. Small groups of live in len cliildren\ 
organized into ability levels, leaehers present ehildrcn wiili paiierned \ 
learning materials, elieii eonstanl verbal responses, and provide rein- \ 
foreemenl for eorreet responses. \ 
2. /h'/iavior . \milv.\i\ A/oJW. sponsored by Don Bushell, University of ^ 
Kansas. Instruetion mi skills, sueh as reading and arithmetie. and so'eial 
skills through reinforeenient procedures wliieh feature a token system 
and individualized programmed instruction. The aim is to have children 
eventuallv learn a skill for its intrinsic reward, 

Cognitive Discovery Programs 

1. The /'loric/d rarent- Educator Model, sponsored by Ira CJordon, 
Universitv .of Florida. Involves parents and teachers as'educalors and 
thereby insures both home and school instruction. A mother from the 
local community is a parent-educator who works with parents in their 
homes and with teachers in school as a teacher's aids, The curriculum is 
based on Piagetian theories of cognitive development. 

2. Tucson Early /-ducaiion Model, designed by Marie Hughes, 
sponsored by the University of .Arizona. Concentrates on language 
competencies, especially the transition from Spanish to English for 
Me.\ican-.\merican children, .Allows children freedom to choose 
activities, (osters cooperation among children, and uses systematic 
positive reinforcement from teachers in order to promote intellectual, 
motivational, and social skills. 

3. The Responsive Model, designed and sponsored by Glen Nimnicht. 
I-ar West Laboratory for Educational Research and Development.' 
Curriculum stresses "development of a positive self-image and 
intellectual abilities through self-pacing and self-rewarding materials' 
known as a responsive-environment. Materials of instruction consist of 
problem-solving skills, sensory, discrimination, and language ability. 
Children freely explore and make discoveries in a structured 
classroom. 



Miisscll. J.S, Planned variation in Hciui Start and Folhw T hrough. In.I. C. Stanley (Hd,). 
Compensutorv cdiuumm for children, a^c.s 2 n> S. Baltimore: Ihe .lohns Hopkins 
I niversitv Pres.s. i^7}. Reprinted v^ith modifications by permission. 
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4 The Co.niinr M'L hy l):.vicl Wcik.rt. High Scope hduoaiumal 
Research l-OLimlaiion. CLirriculum is based on iheones of 1 .agei. 
lea rs nu,ke decision and plan detailed lessons and acnmes. 
Classroon, supervisors assist teachers, H.une tratnniy scsstons with 
mothers are also stressed. 



Discover) I'royranis 



I .1 rru,>,wlu-Orienu:l Mn.lel. sponsored by the l-.ducatiot, 1 )eye op- 
nient Center in Newton. Massachusetts. An advisory atul eonsultan 
Ivsten. v^hich emphasizes self dcveloptneat and ■n'''^ "'^"'' , ^ 
s'tvles lor teachers and students, leachers areenconiayed to c.xpen.nent 
uith wavs or losteriny self rc.pcct. tespcct lor others, nnagtna uon. 
"Iriostty. persistence, openness to chanyo. and abduy to challenge 

'^'ne Hank Slrcc, lolh-^r developed and sponsored by ihc 

Ha, Street College . 1 dueation in New York Ci'v. I-.n^P asts on eac , 
child beeonnng sell-directcd in his or her learnuig. U-ach rs act 
consistentlv as adults clnldren can trust: develop children s scnstttMlics 
to smhts. sounds, and ideas: and lostcr children s pos,t.\c images ol 
.hcnisclves as learners, As ni ttaditional tntddle class nurserv schools, 
children un.iate ac.i^■.,cs. such as cook.ng. block-butUhng. and soco- 

However the evaluation itself had its limitations; Random as- 
signment of models to eommunities and children to modeLs and 
control uroiips could not be done: conseciuently. the design was 
not a ckissical experimental design, but had all ^^^^J^^'l^ ^' 
uuasi-e.xperimental design (Campbell c*;: Stanley. 1966). Other 
factors also limited the evaluation: D The attrition rate was as 
hiuh as 50 percent in some sites. 2) [ he measurement instruments 
focused primarily on cognitive achievement because instru- 
ments that were reliable and valid were available for this out- 
come: consequently objectives in other domains were assessed by 
less reliable or valid instruments or were not assessed at all; and 
even for cognitive achievement, the instruments were weighted 
for asscssin« ucneral abilitv in comparison with general and spe- 
cific achicvenment. 3)Other methodological problems also oc- 
curred in the studv. such as limitations in the instrument used 
(tloor and ceilin^ effects): questionable test interpretation proce- 
dures for example, using the child instead of the classroom as the 
unit ofanalvsis: and inadequate adjustment for iTiitialditjci^ccs 
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between experimental and eontrol i^rDups. lor all tliese reasons, 
the results of tlie evaluation eould only be eonsidered lor the 
formulation of hypotheses to be tested subsequently in rigorous 
.small scale experiments rat her than as evidence on which definite " 
conclusions could be reached (Bissell, 1973; Kennedy, 1978).^ 

The sample of children consisted of L569in Planned Var- 
iation C^lasses (the experimental i^roup) and L()7S in ^Yegular^^ 
Head Start comparison classes (control group). The children 
ranged in age from } to 6.5 years, but 72 percent were between 4. 5 
ami 5.5 at the beginning of the Head Start Program in the fall., 
riie ethnic sample in percentages were lilack, 55: White, 25; 
American Indian, 2; Puerto Rican, 2; and Mexican-American, 1.' 
Forty-three percent of the parents had attended onlv grade 
school and 49.5 percent had attended school only to high' school. 
Although the experimental and control groups were coniparabic 
on demograpivic criteria, marked variation-still occurred within 
specific sites. Thi> variation may not have been well controlled in 
the analysis of tlic data. ... 

On the degree of implementation of the models, teachers^ 
in the Pr^?academic model rated highest, followed by those in ' 
( ognitivc Discovery, and then in ihe Discovery models. Therea- . 
son lor the variation was tiot attributable to'ycars of teaclinig 
experience, but to the difficulty in learning to implement i-jevvA 
types of instructional models. Nevertheless, teachers "learned to ^ 
teach and emphasize their modelsVareas of concerns. Kor exam- 
ple, the Preacademic model showed greatest frequency of num- 
bers, letters, and language training activities: Discovery models 
revealed greatest degree of occurrence ol so... studies activities 
and instruction in^colors, sizes and shapes: Discovery programs 
indicated highest amouni of role playing activities (doll play and 

lantasy). However, regular (control) classes also cmplia^i/ed 

cognitive training, which would ol course, .-dimirrrslf^^^^^ 
between the rcg^ular ehLs^es'anc■r^^^^^^ model classes in this area of 
insiruction:"^ ' 

Similar prnhlcms vscrc I.) plauuc the f'nilcw I hr„uuh rvaluation as wc shall sec in the 
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Ihc results of ihc comparison between the nuulel iuul 
regular classes imiicated that the Preacaclemic and Coiznitive 
Discovery models made the lartiest yains on academic achieve- 
ment (a combination of six subtests of the New York University 
l-ariy Childhood Inventory which assess knowledtic related to 
specific areas of science, mathematics, letters, and numerals) and 
general coiinitive development (the Preschool linentory and the 
Stanford Hinet IntelllLicnee Scale). However, only 10 Preaca- 
demic Y'ii^ses. S (/ognitive Discovery, and 2 Discovery classes 
participated in this evaluation. Although chilchcn^ achievement 
and cognitive development were positively correlated with the 
/level of academic background and practical experience of then* 
teachers, the same relationship did not occur in the model 
teachers' backgrounds which suggested that theinservice training 
contluctecl by the sponsors made up for these variations among 
iheir teachers. 

On the Hert/ig-Birch measure of styles of eoping with 
cognitive demands, which assesses the way the child responds to 
tasks on the Stanford-Binet. the results suggested that children in 
the models programs had learned tQ engage in task specific activi- 
ties and to decrease irrelevant verbal and nonverbal, responses 
when asked to do a cognitive task. In short, the children in the 
models programs seemed to learn to express only correct re- 
spon.ses. This type of learning was eniphasi/ed in the Prcacadem- 
ic programs, which taught the child to respond only with corrcct 
verbal responses. Consequently it is understandable vvhy children 
in the Preacademie pi ogranis made the greatest gains in not re- 
—r/.sponding whenever they apparently could not do the task. 

On mother-child interaction, which involved assessing 
mothers on teaching their children to sort eight blocks and the 
chiUfs success in doing so and then explaining the basis for the 
sorting, the greatest gains were made by the C\)gnitive Discovery 
and Preacademie classes. Maternal use of praise was highest and 
children^ success greatest in the Preacademie classes. Again, the 
differential gains among the programs reflected the relative em- 
phases among the models. 
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When interviewed, [nirenls in regular programs fell thai 
regular Mead Slai l elasses provideci.iheni primarily v\ iili baby sil- 
ling and day eare for ilieir ehrlclren. In eontrasi. paicnts of Pre- 
aeademie programs stressed aeademie and learning perfurmanee. 
In other models, parents stressed interrelationships among ehil- 
dren ami teaehers. Again, the results tended to relleet dilTerential 
goals of the models. ^ 

In general. Hissell(1973) eoneiudes that the findings sug- 
gest an equality ofvjfccispauciiu that is. ehildren tended to have 
equal gains in LOgniti\e performanec and aeademie aehievement in 
all well-implemented classes. But Hisscllalso found that a pattern 
o\ specific cjTecis had emerged: ehanges in ehildren tended to be 
eonsistent with orientation of the program in which the children 
had participated: these specific effects were greatest on 1) 
achievement and cognitive measures and 2) cognitive response 
st\ les. 

Do gains matie in preschool programs persist through the 
primary grades? Citing studies by Bellcr { 1969). Engclmann 
(1970). Hereiter (1972). and Weikart (1969). Bissell { 1 973. p. 
65) concluded that "poor children who had had preschool 
experiences perform appro.ximately on a paDwith their peers who 
have not had preschool education, and both groups perform 
below the national norms." I'wo explanations were given for this 
phenomenon. Ihcy can be called; 1 ) I'hc Icvclini^ off hypothesis: 
Icachers do not build upon gains children made in preschool but 
instead have them repeat the same curriculum; consequcntlv 
other children have the opportunity to catch up. 2) Vhc spcct'/icitv 
hypothesis. Preschool programs only develop functions that are 
specific to preschool^Acars and these functions do not result in 
impro\ement in later years. 

However, the failure to maintain gains made in preschool 
is not e\elusi\e lor ehildren from poor and predominantly black 
t ami lies but also from other socioeconomic strata and from 
Caucasian- families as well. For example. Durkin (1974^1975) 
started a voluntary two-year language arts program for/37 four 
year olds, 16 girls and 21 boys, in a small mid western to\vn near 
Champaign. Illinois, which had some lower and middle class 
fan.ilres. but also had a "preponderance of upper-lo\\\*r class 
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families;* All children were white, exeepl for one boy wlu) was 
black. Ihe mean IQ o( the grt)iip on the Sianl\)rd Hinci 
IniclliLiciiec'resi was 1 13.6. Al the end o[' iwo years the average 
iiinnber ol" words iliai llie 33 sludenls wht) had CDniinued in ihe 
program could corrccily ideniily was 125: lliey could also name 
on ihe a\erage 47 Idlers (upper and lower case) anil iliey knew 
st)unds lor 16 Idlers. The conirol group Cninsisicd ol 26 boys and 
23 girls, all Caucasian classniales, who had nol allended ihe 
preschool bui who had allended kindergarien. The control group 
could ideniily only 1^ words and name 41 Icticrs al ihe end ofihe 
kindergarien year. In grades 1 and 2 ihe experimenial group was 
signilicanily above iliCconirol group. Hui in grades 3 and 4. ihe 
dilrcrcncc belween llie experimenial and conirol groups on llie 
(iaies-MacC;iniiie reading achievement lesi was no longer 
statistically signilicani. Durkin (1974-1975, p. 59). implied thai 
ihe con\ergence between the two groups had occurred because 
nol all laculiy members nor the atlminisiraiors ol the school were 
•^committed to a program in which instruction is matched to 
children's current achievement. . . 

■ - At this point, wc want to present M*e results ol ai;t)lher 
aiicmpi to determine whether intensive instructitin through a 
models apprt)ach in the primary grades would enable children 
wlu) had made signiricani gains in Head Start to maintain their 
inci eased rate ofdev ekipnicnt in the primary grades and whether 
such intcrventit)n wt)uld 'enable children Irom poor families to 
overcome their cducatit)nXl disad\ antagedness. Some studies, 
for example. Karnes et al. (\969. 1973). had demonstrated that 
such a continuatit)!! would bc'beneficial. Howcmm*. in this study, 
known as I*rt)icct lu)llow Ihrough. the models were not only 
compared with each t»ther. as had been done in the Head Start 
study, but also with control groups taught by regular public 
school teachers. .A major question then was whether the models 
approach with experts pro\iding inservice training, curricular 

."Mn\\c\cr . prvcDLions i\MJcrs ulul^c mcili.in \l) \\a^ i: "! wcrv Miini!ic;inil> ahtuca 
c<»nirol iirniip oi) rcailini:.KhK-. L'ir.cnnSianl.>lil Intcrmaiiaic KcaJitJU Ic^t)aI ihccfuidl 
sixth grade pcrhap^ hccali^c ihc precocious readers* moihers who had played a signilicani 
insiriiciional r(»le in ihcir children s achievemeni were still exertiny such an inlluencc 
(Durkin. i 
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nialciials. uiul stall help would accelerate ecliicatii)nally disad- 
\antaized .stuclents' development. 

« 

Project Follow Throuyih 

. Project Fh)11o\v rhroiigh// began as a pilot study in 1967. 
It was authorized under the Economic Opportunity Act. PL 90- 
92. The OlTice of Keonomic Opportunity then delegated its 
authority lor operating the project to the U.S. OlTice of 
lalucation. Although Follow Through was primarily an 
educational program, it also provided comprehensive services, 
including medical, dental, niuritional. social service, guidance, 
arid psychological services to the project's students. Further- 
more, parents participated in the program. Consequently, staff 
members in the project worked not only with schools but also 
with health, welfare, and social service agencies located in the 
community. The aim of the project was not just to improve 
academic .achievement but also the child.'s *Tife chances." 
Consequently the project also emphasized other aspects of child 
development including development, of "confidence, initiative, 
autonomy, task persistence, and health'' (Bissell. 1973, p, 88). 

T*hc. main Follow Through Project began with a meeting ' 
in Kansas City in 1968. School district officers who had been 
invited to the meeting heard descriptions of each model and 
selected sponsors. A list of the sponsors' models, categorized into 
three groups according to their major emphases, is in Table 3. 



Dcseripiions of Categories and Li.st.s of Models 
in Fiach Category'" 

I. I^asicSki// Moilds ■ ' 

I ho tm>dcls in this calcgius" \scrc: 

I^ircL'l Instruct inn. I'nis crsity nl Oregon 

Uoliavior Anal\'sis. University ol Kansas 

"In rcpDriing on I*rojcct hollow rhmiigh. wc have drawn primarily on the lour volume" 
report, pariicularly Volume IV .\ by Siebbins. St. Pierre. Proper. Anderson, and Cerva 



• I'd lo dtlfeleiii lalc I he pioieci iianic I rotu the ucnerie luitne b> w mini: I he project name 
^^nh c;tpital letters , iti the intlial posnioti ol eaeh \wtril and the generre nanic^uiih Ujsut 
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I.aniiiiay:^ IV\clopincm (Bilin):ual Approach). Southwost 

I'diK'atiDiial Dcvclnpnicni l aboratois 
C alitoriua Pro».•L'^s, Calil\)rnia Stale Dcpatiincni nl iducaiion- 

DiviMon ol C^)tn()cn^a'^o^v l-ducation 

I hcsc imnlcls ^^K•u^ primarily DM itiNM ikiiDn in \ ocahulars , ariihincli.-. 
coinpmaiion. spelling:, and lan|!iia^c ^kills. l-our MlhtL•st^ ot (lie 
Mdropoliian Achicscinciil IcM were unccI lo aNNCss I'olUnv Ilirouiih 
clTccis in these basic skills. The assuinption of these models is thai the 
best way to obtain educational el'lects is throuiih direei insirnciion, that 
is. teachers arc^ to leach ami have slndcniN praciiec whai ihev uani 
simlcnts lo learn. 

2. Ci>\ifunvc-Ci>ncc(nual \ttHlcls 

I he model> in lhi> caiei»ory were: 

I'lorida Parciii I'dueaiion. llni\cr>il> ot hlorida 

I iicson I'dncaiion Model. Arizona Center tor I*arly Childhood 

I'dncalion 
Cognitive C'lirriculinn 

I heNe nuuIeK einpluiNi/c •'learnint:, how lo learn** and problem-solving 
>kills. !c^l^ lor assessing these objcelives are l<a\en*s Proi:res>ive 
Matrices .md Metropolitan Aehievemeni lesis: Maihemaiical Con- 
cepts. Maiheniaiics l^roblem Solvnii:. and Reading: Comprehension. 
I he assuiMpiion underlying: this model is thai irsiudenis learn how lo 
learn ami how lo soke problems, ihey can apply these proeesses to 
acquisition ot knowledtie in all content areas. 

3. At'i'ciiiyc Out conn's .\/(>(h'l\ 
Ihe moilels in this catciiory wore: 

Matheinaiicnic Activities I'lograin. University of (ieorgia 
Responsive luiueation .Model. I*ar West laboratory lor 

I'llucjitional Research and Deselopmcnt 
Interdependent l.car' in^;. City liriiversiis of Ne\s VL)rk. 

institute lor' Desclopmental Studies 
Hank Street College ol ilducation .Approach. Mank 'Street 
■ College / 

/•-VX* Open I'ducation I-ollow Ihrough, Tducation I)e\clop> 
nient Center 

Ihesc models toeus primarily on development ot a positive self concept 
and fas orablc atlitiules towards learning, and secondaril> on 
development of cognitive, conceptual skills, (he Coopersmith Selt- 
I'stecm Inventory and the Intellectual Acinexement Respcuisiblhty 
Scale measure/ aspects of the affective domain. I he assumption 
umlerlying these models is that if students accept and feel positive about 
theniseUes. thes will be able to desote their attemion to the external 
vsorld t>i '-'arning ami achie\enient. 
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"["able 4 ctuitains a more detailed description of the 
models. 

Table 4 

Fallow Ihrough Models'* 

fjiisic Skill s .\finh'l\ ( Sfnu turcd AcadcniU- Apprtwcht's ) 

I licsc approaches place hea\ \ emphasis on icavhint; acailemic 
skills and concepis \s ilhin the classrooni ihrouLih prngraninied instrue- 
lii>nal lechniques. As in ihe Head Slari l^reacadcmlr nu)dels. each nt 
ihese apprc)aehes u^^ s an analysis i)f ihe C()nip.>nenis uhich makes up 
desired behavioral Dhieeiives lo iiuide a caresm >ei]uencini! of learninii 
e.\[)erienees and a consislenl use ol'.exiernal reinl'orcemenl. Highly 
siriiciured educarional en\ ironmenis are useeJ-by all ihe>< sponsors lo 
"engineer** acceleraied rales of learning, alihoiiiih. ihc\ vary among 
iheniseKes in ihe specific curriculum conienl, in Ihc degree of Individ u.- 
ali/ed learning, in ihe respccli\e roles. plnyed by leaehers, parenis, and 
nialerials. and in ihe emphasis placed on -the child's iniliaiive and auion- 
om\. I he li\e approaches in ihis group are: 

1. fit'havitfr Analysis Sftnli'l. sponsored b\ Ddu Bushell, Jr., ,Suppori 
and I)e\elopnienl Ccnler for Follow I hrough, Fni\ ersily of Kansas. In 
ihis approach, leachers use a loken syslem ol' posiiive reinforcemcnl 
and indi\ iduali/ed programmed niaierials lo leach social skills (such a.s 
laking Ihe role of ihe sludenl) and academic skills in ihe areas o! lan- 
guage, reading, wriiing. and maiheniaiics; parents are hired lo work in 
ihe classro(mi alongside leachers as behavior modifiers and luiors. 
1. Iniliviiliiiilly /trcM rihcil iti'^trm fmn atnl (he /frifnaf v rduufion pri)- 
/n f (ff 'l.f*), sponsored by I auren Resnick and Warren Shepler. Learn- 
ing Research and Dexelopmeni C'enier. l;ni\ersiiy of Piilsburgh. These 
approaches proxide an individualized, sequenced program of insirue- 
lion for each child which loaches him academic skills and concepts in 
the areas of language, perceptual motor mastery, ciassii'icaiion. and rea- 
soning. Diagnostic tests determine each child's strengths and weak- 
nesses and are used by the teacher to prescribe instructional nuUerials; 
positive reinforcetnenl is given continually for success in learninLi. 
3. /"//<' latjji^tidi^t' dvvciopnictif ■ bilingual cchicufion approacli, spon- 
sored b> Juan I uian. .Southwest Iiducational [development Laborato- 
ry. This approach vsas originally designed to meet the educational needs 
of poor .Spanish-speaking children (it is currently being adapted for use 
with French and other non-English-speaking children as well) and 
teaches mathematics, science, and social studies in the children's native 

"In Bissel!. J..S. Planned variation in Head Start and Follow Through. In J.C, Stanley 
(Ed.). Cdmpensaiory eiiucatinn for ^ hildrvth ai^cs 2 inH. Baltimore: The Johns Hopkins 
I'nivcrsity Press. 1973. pp. 90-93. Reprinted by permi.s.sion. 
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hiMLiiiagc. while sinuilianctuisly leaching: I-imliNli nccomi* hiniiiiaLic: 
its riRihcHis inc liulc cMcnsix c ii.sc of slr uciurcd drill techniques, reliance 
oti tnaterlals rele^anI to the children's native backiiround and experi- 
ences, aiul iie\e!i)pnient ol oral languaiic prior to written lariLiiiaLic. 

4. Ihc rfiathema^cnir activities proiiram. sponsored by Charles Smock, 
Sehool ol Hducaiion. University oKicorgia. Of central impi>riance to 
this appriKich is'ihc ctiiphasis (Mi childrenS IcarninLi-by-iKntfu in a se- 
quentially vlructurcil ciuu-onincnt dcNiiined to teach skills and concepts 
in niatheniatics, laiiuuaue, science, social studies, an, nuisie, atul physi- 
cal eiliicairon: i hildreti leai n through se!f-inuiaied, nuiueli^ e soK inii ol 
problems which are litiely sctjucnecd to assure both adxances in utuler- 
statiilini! and a hiuh lc\el i>f piisiii\e reinforcetnent. 

5, Direct Instruction, sponsored by Siegfried Hngelmann and Wesley 
Becker, Department of Social lidueaiion. University of Oregi^n. The 
priniar\ toeus ol this prouram is on proniolini: skills and concepts es- 
sential to reading, arithmetie, ami language achie\ chienl through .strue- 
liired rapul-tire drdls ami remftMccmem techniques using rew;u\l.s ami 
praise to encourage ilesired patterns ol hcha\iiM-: small stud> 
groups ot Ine to ten ehiklren are i>rgaiii/ed b> teachers according 
tn abilit\ le\els in onier to lacditaie presentation ot patterned learning 
niaienats ;uul to elicit consianl Verbal re.sp.onses Irotn children. 

C'itmcjuual Coi^ni/jyc Mot/ch f Ih.\c(nvrv Af){yri>ay hc\ ) 

1 he basic goal i>l the !)iseo\ery approach is to promote the 
develop tnenl o! autonomous, selt-contident learning processes in 
ehildren, rather than simply transmitting specific knowledge and skills. 
•Mlhouiih, like the- Structural-Academic approaches, they locus (Mi 
childretrs classroom experiences, their emphasis i.s on teaching a 
programmed seqi'cnec o\ materials, but rather (in pronuMing 
e.xplor.ition and tliseo\cr\ in an en\ ironmeni w fiich is rcsponsi\e to the 
eluldreirs own jtiitiaiixc. llea\> emphasis is placed on intrinsic 
motualion and the gratilt'. ation ehildren deri\c from niasier> itsell. 
C'ogniine growth is seen as only one eoMiponeiii iif the child's ego 
de\ elopnient. inseparable from a posin\ e sell-C(nieepl, curiosity, 
independence, and tlie ability to e(n)perale with inher.s. 1 he ihrec 
Discovc! . approaches are: 

1. Ihc Ihmk Street Cullc^e minlcL sp(m.sorcd by F-li/abeth Ciilkeson, 
Bank Street College (d' luiueaii(^n. By funciicminu as ec^nsisieni adults 
ibal children can trust, by being responsive to individual children's 
needs, and by sensili/ing them to sights, sounds, feelings, and ideas. 
Bank Street teachers help children build posiii\c iniag - of ihcniselves 
as learners: they introduce themes of study and play 'fclcvani to class- 
room life, encourage children i(^ explore various media, support chil- 
dren's making of choices and carrying out plans and help ihem use lan- 
guage to formuhiie ideas and feelings in order to promote self-confi- 
dence. cn\ironmenial mastery, and language expressiveness. 
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1. The fulut tifion Ih'vt'Iofnnrni Crnii't nunh'L sponsoicii by CicDt j-v 
llcin, I-ilucalfi>n I)c\i''i)pmcnl CciUcr. i hi- ijpproiich liishiDtis c1,!n>- 
liUMU cm irDMincnts icspDii^ivc in the iiulisidiKil nrcils ami sl\lcs o\ 
chiulrcM ami teachers in aecDrdanee u itii the V-open cia.ssiDonr'cDnccpl. 
\\hieh has re\ Diiitioni/c^i l^ritish primary schools i>\er the last se\er;ii 
\e.ars; ii is an a(J\isory ami ciMisiiltant system which ciicouraucs schools 
ami leachers to f\pcrimem \\ithMiversca\ cnucs ior I o.stcr in children's 
sell -respect, rcpeci lor others, imagination. curiosit\ . persistence. 
openncsN to chanijc. ami ability to challenge ideas. 
.V Thr rt'\f\/n^i\t' cnvironnwni nualcL sponsored b\ (ilen Nimnichl. ^ 
f ar West 1 aborator\ lor I'ducational Research and l)e\elopnient.ln 
this approach, chikhen are Iree to set their ou n Icartnng pace and t»^ ex- 
plore the elassri>oiii en\ iroiitiK*ni. which is arranged to facilitate mter- 
connected diseo\ cries about the physical cn\ironmcnt and the siK*lnl 
uoi ki. The luo primar\ ubicetives. helping chililren de\ clop a po.sii:ve 
scU'-itiiage. anil promoting their intellectual ability, are achie\tvl 
through use til sell-correcting gjmcs and equipment uhich emphasize 
problcm-siih ing skills. sensj)ry discrimination^ and language ability. - 
ami uhich pro\ide ininicdiate lecdbaek ami cnioyment from learning 
itselt. 

.M'h'\ t\w f >n/t <*fnc\ / ( 'o.ijMfVn \ ' Ois^^ vwrv Apf^nnichrs ) 

The C'ogniti\c 1 )isco\ery approaches a re less syster.uitically sim- 
dar lo one another than those ineithcrthc Structured Academicor His- 
eo\er\ groups. In general lhc\ promote the growth of basic cogniti\c 
processes, such as reasoning, classifyitig. and ciUinting. through hi>;hH- 
tlireeleti leaching of specific acailcniie skills, through children's lunono- 
nuMis tiiseiner\'. and thnuighconstant engagcruent i^f children in \erbal 
activities. Proponents of these approaches share a willingness to be ec- 
lectic and to include iii\erse program elements in their curricula. I he 
Uiur approaches in this group are: 

1. Coiinifivt'lv onrfUcd currunhifti ttunU'l. spDiisored by na\td Wei- 
kart. High Scope I^duealional Kescareh |-oundation. I)eri\cd fnuu tlie 
theories »>l l*iaget. this nuulel fosters chiliircn's understanding of fiv.* in- 

■ tellcctual tliur.ain.s (classification, numbers, causality, tinie.'and space) 
fhrough experimentation, exploration, and constant \ erbali/ation on 
thr part ii>f the children, through planning oi'dctailcd lessons on the part 
ot the teachers, and through extensive observation and assistance on tlic 
part 1^1 super\isors. A home-teaching prograru provides an oppor- 
tuiut\ tor pa're.nts to beconie ilireetly in\ol\ed in the cilucation of their 
cfiildren. '■ 

2. /"//<' I'lurulii f>tjrrnf'CiIm af(ir nunlfi sponsored by Ira (iordiMi. I'ni- 
\ersit\ ol f lorida. In adtlition loprm iding vva\s to improve classrooru 
organization and teaching patterns, this nuuicl trains parents to supcr- 
\ ISC learning tasks in the home in order ti> increase their children's intel- 
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Icctiiiil. personal, ami social conipcicncc. A key element in ihe pro^iram 
is luuni: moihers i)! l-ollou 1 hroui^M eliiklren as parent-edueatois who 
(mielicMi as teacher a u)es in the cliissrooni ami uho uoi k with other 
mothers in theii luMues. I he eni rieuhmi is !le\ible ami \ariesaeeorilii^i^ 
to the needs ol particular indis idiials ami classes, but there is an or ienla- 
tion touan! the theories ol l^iauct. 

}. flic inwrilciH-fulcni learner nttnU'l! sponsoreil hy Don Wolte, New 
York University. In this motlel. Icarnini; occurs principally in structured 
small-Lzroup instructional **i;ames** whe^c children ofdincrcm ability 
levels teach one another and become relati\el\ independent ol' the 
teacher. Theverbal transactions between cfiildren. wfiich are implicit in 
the process, are a direct stimulus to lan^^uai^e development: experiences 
in phonic blendiniz and decoilin;- skills stimulate readiniz ability; and 
l;ini!uai:e-niaih-logie iiames such as C'uiseiuiire rotis :uul matrix boards 
pr(>moie maihemai ical understandini;. 

4 flu' lui Mtfi t'dili ciliu (ilion nioilcl, spowsDvcd b\ Joseph hillerup, 
[ 'm\ersii\ ot Arizona. Major object rves ol this model are to promote 
iariiiuaiic Lompetence. rriielleci ual skills neccssarv lor learnini: {e.i:.. the 
ahrlil> t(» attend, recall, and oryani/e). positive attitudes toward school 
and learmni!. and skills in particular subjcci areas, such as leadini: and 
matliemat ic>, and in social interaction. Methods ernphasi/e iruli\ idiral- 
i/ed expericrKCs ami interest a> uell a** the treijueni use o( positive 
reintorcenient h\ teachers 

The result ol' the meeting was th:u eventu:illy :i total of 170 eom- 
munities and 75.()()() low income ehildren were served on an annu- 
al budget orS59.()()().()()() provided by Congress. 

The sponsors and the sehool districts eaeh had responsi- 
bilities for Projeet Follow M . '.).gh. Eaeh sponsor was to a) pro- 
vide a well-defined theorei .» Model, b) give eontinuous techni- 
eal\assistanee and insure implementation ofthe model, e) observe 
pro'gress. d) maintain the model's objeetives. and e) partieipate in 
evaluation of the mo'deL In turn, the sehool districts were to a) 
ehoose a program sponsor, b) select ehildren for Follow Through 
who had been in Head Start or similar programs, e) contribute 
some funds from Title I to the program, d) invoUe parents and 
the community in program planning, and c) engage teachers and 
school staff in the project. 

Alter an initial ,starting up phase which allowed time. for 
the models to become ihte'grated into school seltinrs. an 

'■^Sclf-sp (in sored and parent implemented approaches are not included in theanalysis in 
this report; hence they are not described here. I'or a description of them, see Bissell(|97.1). 
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c\aluatit>n plan was initiated in 1971 that used sajaplcs of 
students who had started school in 1970 and 1971 and continued 
through thethird grade Table 5 lists the grades and years of the 
Project^s evaluation study. 



Table 5 

Ciradc Level of Follow Through Children 
in Each Cohort and Stream, by Year>^ 
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To make the evaluation fair, tests were selected that reflected the 
effects of models in each category. The tests that were used arc 
listed in Table 6. 

The design of the evaluation was to compare Follow 
Through Students with a matched control group of non-Follow 
Through students in kindergarten through grade 3 over the years 
1969-1975 for three successive groups of children. But the Follow 
Through and non-Follow Through groups did not match in eco- 
nomic disadvantage. The median income in 1 970 dollars of the 
Follow Through groups was only $4000 while the control group 
was S6000. 



C c.nn.iLts loi {lKH-\ahi.ition vvt'tit lo Siatihml Kcst-.tivh Insliluic Um ihila colicclion and 
m \HI AssoLi.iics lot data ;iiial\sis. lliir(Mi Insliiulc cotisiihal lUi iIk* c\aluatu>n. 
llouL-tci. the ; S OMiLC ol I Jtication uas tvsponsiblc.lor ihc linal (icsii:n oi ihc 
t.'\ahiaii«»n. 

"-Sicbbins. I..B. ci al. An cvaluaiion of Follow Through. Vol. IV A. In R.B. Anderson 
(IVojcct Director). iUUicatifm as expvr inn-nun urn: A pUmnvii variaiUm (Final 
Report). I'.S. Office of Hducatioti. 1977. 
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Table 6 

Instruments used in Assessing 
Objectives in Ciradc,s K-3 



Basir Skiitv 

Wide Range Aehieverueril I'esi ♦ 
l>caboiiy Piciure Vneabulary, * 
C"aUlwell I're>clu>ol inventors * 
Meiropolilan Aehicvenietii Test (MAI) 

l istening tor Sounds 
MAl-r'riniary I or il or i- lententary** 

\Vof;d Knowledge 

Spelling 

Math C onipmaiion 
Language Arts (A and H) 

~M A 1 -Primer**. Primary I. II. I'lemenlary 

Reading 

Miilh Concepts 

Math Problem Solving 

Ravens Coloured Prtigressixc M.alnees 
-Mlective Skilj s 

Inlelleciual Aehieveinent Responsibility Scale 

Locus oJ C'onirol (positive) 

Locus oi C t>ntrol (negative) 

Absence from Scliool 
(*o(.persnnth Seli' Lsteeni Inventory 



pjiteritig Leaving 

:ror 1 'K' 1* 



•Measures used only tor the third successive group. C\>liort 111 who smarted 

kinderuartefj (I I IK). , , „ n 

••MAl-Primer was given al end ol kinderiiarien. Primary I in grade 1. I nmarv 11 

grade 2, and Llementary in grade .V 



■ The experiment also had other limitations: 
• 1 5pm//n7>'o/rr^a/m^mAa) Changes continued to occur 
in models^ objectives and goals a^pd b) Uiere was no adequate- 
assessment of degree of implemenUtion of each model 

2. Measurement battery. Basic skills domain was repre- 
sented better in . the - measurement battery than were the 
conceptual and affective categories ^ models; and 

3. Weakness in design, a) No random assignment to treat- 
ment of subjects or sites and b) the Follow Through and non-Fol- 
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low Througli groups were not matehed on the economic criterion. 
However, the failure in matching favored the control group and 
therefore hiased the results against the experimental group (Fol- 
low Through). 

The ethnic distribution of the students was markedly 
skewed. Blacks and F'uerto Ricans were predominantly in li rgc 
cities. Mexican-American pupils were concentrated in the West, 
and white students were most frequently in the North Central 
area.s. 

1 hrcc t> pes ol comparisons were made; 

1. Local, Follow Through students were compared with, 
non-FolUnv Through students at local sites. 

2. Poo/ccL Follow 'I'hrough students were compared with 
pooled or total non-Follow Through students. 

3. Xorm rc/crcncvcj. Follow Through students were com- 
pared with standardized test norms. 

I hcse comparisons were designed to answer these ques- 
tions: 

1. Does Follow Thrcnigh have a greater educational effect 
on disadvantaged children than does the regular school program? 

2. Do the various models ha\e differential effects on the 
education of disadvantaged children? 

. 3. Which, if any of the models, show consistent effects 
o\er time? 

Underlying the questions were these hypotheses: 

1. Cultural deprivation. In contrast to middle class chil- 
dren, educationally disadvantaged children do not receive as 
much formal or informal preschool education. Con.scquently the 
cognitive and affectiv e aspects of development are adversely af- 
fected. Since these aspects of development underlie educational 
achievement, their school performance is also inadequate. Some 
Follow Through models focused on compensating for this early 
deprivation by attempting to accelcrate.the rate of cognitive and 
affective development of Follow Through children. 

2. Failure of schooling. The relatively low achievement of 
educationally disadvantaged children is attributable to teachers 
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.vho l^^.vc knv cxpccialions tor poor children and conscquuul> 
p -ihc cnrr,culnn. downward. cMnbil negative aimndes 
ow^ d these eh.ldren. and use inetteetive nistruet.onal proee- 

Tr a d ehn.qnes in teaeh.ng then. The antidote ,s better 

u"el e s and tead.ng wth higher aeademie expectations mor. 
v^^enuu.c .nstrnetion wnh a ehalleng.ng -d stn.n a g 

■a.rr.cuU.nu and a .nore lavorable atftnde towards d.sadNan- 

fio.vi I'liiklrcn in ilic classroom. , 

' I he results of the study are yoUuninons. I ortnnately they 
have been summarized in three charts. Figure 2 dep.c-ts the aver- 

g cttects on Basic Sk.Hs. Figure 3 on Cognmve-Concep u 
st lis and Figure 4 on Atteetive Outcomes. 1 hese graphs wer. 
''nsmlted b determining whether the Follow ^^^^P 

■,, particular site had a significant positive oi negame ctkc on 
n o instruments as compared with its matched non-Fol ow 
V^; ough group. A positive or negative e.lect s.gndican i t 
ciiftcrence between the two groups on the .J' 
o luartcr of a standard deviation dilference. I his d.lteren e 
.s "b ut equal to two months in grade placement on a .s.andard- 
r.;-d test Vhen an average effect was con.puted lor each site ac- 
cording to the following iormuia: 

Average effect =NurrU^e^^ 

Voial number of effects 

The results were then plotted to make the bar graph. F"-" "^""P'^. 
Fiuurc ^ shows that the Direct Instruction model for Follow 
Zugh children on the Basic Skills category had more sues in 
^u"h there were positive than negative eiiects. 1 
nntunitudc of positive and negative cltccts. as ;'^ci.c.Ued b ted 
on the eraph for the Direct instruction model, is about 0 4. In 
h words when compared witn non-Follow Through classes 
^ s vIricH. sites, the bircct Instruction model came out more 
oft n o h positive than on the negative side ol the comparison. 

Howe'-r. note that :hcre was considerable variation lor ea.h 

model 
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Figure 2 

Ranges of Site Average Effects on Basic Skills 
for Thirteen Follow Through Models'^ 



mi 



— I— 



Mucit InMtiKtiKn 
f-lorul.i rJicDt I JiK.il.ii M.itlcl 
Hcli,iM«it AiialsMN 
M.ithcinJgvnn, Vctisihcs 

Miu • .S«MHh>^cst I JiK,jtii)n I )c\cl(iprticnt J.jh 
(■'.ililorni'i I'riKCNN 
Ri'sptmM^c- I ilusJtiiiTt Mo«)ci 
liilcriJt:p:"»li-nt I e.irniti): 
U ttik Siicct I olleijc Moilel 

("onniti\r Curriculum 

ni» . tducjiion I)c\cli>pment Center 
M I'ranmatiC-Oricntcd Mi'UcI) 

tn f - lfn)i\idi»<ili/cd Farlv 
I e.irninp Pribram 



• (Kcwil A>cr.ii«? r^fm lur the Model 



IUmc Skills Model 
I^.J Cn^nni^c Cnnk-cptiJ.U Model 

["*) ■Mlci ii^e Cii>fiiiti\c Model 



Figure 2 also shows that the Direct Instruction model 
made on the average greater gains in Basic Skills when Follow 
Through students were compared with non-Follow Through at 
its sites than did any of the other models, as shown by a compari- 
son of the dots in each graph. The graphs in Figures 3and 4areto 
be read in the same way. Kote that on Figure 3, Cognitive Con- 
ceptual Skills, Direct Instruction came out 5econd;and on Figure 
4, Affective Outcomes, Direct Instruction was again fir^t in com- 
parison with the other models, 

"Stebbins, L.B. ct aL An evaluation of Follow Through, Vol. IV A. In R.B. Anderson 
(Project Director), Educaiion as expf^fimerUaUon: A planned variation nwdel (rinal 
Report). U.S. Office of Education, I977. 
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Figure 3 

Ranges of Site Average Effects on Cognitive Conceptual Skills 
for Thirteen Follow Through Models'^ 



ESI 



I iitfit l(i<irut.tiiH) 

«n.( ■ Soiithv.€vi I .luv4ii.ni |)i'«.<:l..pmcni 1 .1 

\ l.it.it.i I'Atrnl I .Uua^rt M.-Oci 

I nitnlrpfiutcfH 1 i-artiinv 

H.,1.1. MTCfl I clknc M...Jci 
liKliM.Jtiali/cd 1 .itU I o.itdir.i; rt<.L;r..tti 

( li.t.VtK.I 1't. 

IlK I .t.:...l!li'TI I >i'\t()>fMIlCHI l^-'UvT 

I,. I't.ii:ni.iriv<)rk-»iu-.l M.>«U-1) 



• Om-mII 1 iit-a l..t ihc M. 

( I I .)j;iini\i.- l'.irKepii..il M.kU-1 
{ I Atfcklu-; t .»vfiiUM- M.»iU-l 



A succinct summary of the results which rcfiect "clear and 
general patterns" has been prepared by Anderson (1977, pp. 
XXiy-XXIX). Since we find their summary does agree with the 
data we shall follow it in presenting the results: 

Vuriahilitv among Models (Fii^ures 2. J. 4) 

Xfodels varied in their effeeiiveness from site to site. 
In some sites Follow- I hrough models had more positive 

than negative effects while in some other sites non-FolIow 

,.„vl,n>la H.S.cbh,ns>.-.al,.\nl^aluam.nolloll„u nimughA%>lunu.|VA InRicha^^^^ 
\UhIvI. l inal Rcp"". ' -S. OHkc iil l-ilucatuin. 
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Figure 4 

Kariijcs of Site Average I'ilTccls on Aricciivc Ouicomcs 
lor Thiriccri I-oHdw I hrDugh Models'" 



1 



J .t l,utr„.t,..M 



( .lti(.,ttli.i l'nM•^^ 

iMi- ltuliM.|w.i!(/r.l I .ith 
I (-.iiTiin^ I'lii^-r.irn 



r^r^|«ol..,^,■ I .li»..,1,.m M,,.'<) 
» .if'(liti\.' ( uirMil:,ti) 

<il- I iliii.ili.iri |)r\cN>pilli-iU ( VlH.-i 
I.I I'r.i.rti.iin. « >tii-|iU'«l \fo«Jrl> 

iMM rm>..ti I .,t!> I .|.,..,i,..n M<».kl 

\l.Mh,,.,.,^.,-|l„ \., o 

ItiUii)..-fi,Ti(lfiii I ..-juiiHf; 



• I l^ci.ilt ^^CI.|^;^■ f Hal t*>( lln- MikIcI 
□ H.I. I. ^V.11^ M.Kli-l 
L-J t i>k:iiiliM' t .tf^cpiii.it \f,„|ol 



Through groups had supcMior pcrlbrmanccs. No model was suc- 
cesslul in every site nor did any mode! Fail in every site on Basic 
Skills and Cognitive-Conceptual Skills. However . seven models 
which stressed Affective Outcomes performed on the average 
more poorly in eacli type of assessment than did their comparison 
non-FolIow ihrough groups, as shown by the number of dots on 
the negative side of ilie bar graphs in I'igures 2, 3, and 4. 

Basic Skills Domain (Figure 2) 

i. Models thai Mnjss basic skills arc more cffcaivc in de- 
veloping^ these skills than other models w hich emphasize co^ni- 

•In I nul.i ir Sicbbins cl al. An l\ iilimtinn ut lollow I hroujih. \'(>Iunu-IV A In Kichifnl 
|{, AmlctAon MMnicLi I >irLLlor). dhu aium /.'xprnmrnfanon: A r/annnt \ anaiwn 
\f*Hh'l. l iii.H Kcpoii. r.S, OIlKol I iliiL'uMon. h;77. 
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3 , " ^o^^oUn, hypc„h.,i. gain, .uppor, from .h. 

W , iV . m ,cad,m, basic sk.ll., .hen m, ,hc 

1 ,1 Vhr,ni"l. siu-s ba«c skilk apparon.ly wore Mrc.socl 

> ii,nn„„ „„„ps '''^-v 

A t- live nuKlds M.avov.f. onc-.hirj ..I il.c s,K-sovvn CNo.a . 1 

h1 C ' t he cl'cctuc >nu,cdicnts .n tl,c Duvet Insfuc u, 

' w.^c not is effective as the niatenals and proeedures used 
• ::t;Xd :;f t;e a.uula,- o,- non-Fonow Through classes wuh 
uh.ch the Direct Instruction classes were compared. 

rrurniiive-Concrpiijal Domain (Fiiiure 3) 

^ ' Z on cacorv o/vM nas more sucress/.l than an- 
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thirteen models, control group (regular classroom) pupils made 
greater gains than did the experimental group children. 

Affcciivc Domain (Figure 4) 

Basic Skills Models procluccci performance in the aJTec- 
tive domain tha: was as high or higher than performance attained 
hy models that stress the a/'/ective domain. Apparently the Basic 
Skills Models did not achieve their gain,s at the expense of an ad- 
verse effect upon children's self concepts. An hypothesis which 
appears to be plausible is that as a consequence of improvement in 
, .basic skills in the primary grades, a child's self concept is en- 
hanced. Again it should be noted that for seven of the thirteen 
models, control group children performed higher on the affective 
outcomes measures than did the experimental group children. 

Conch^sions 

We have briefly reviewed the Coleman Report which pro- 
vided a ground plan for the development ofcompcnsatory educa- 
tion programs following the 1954 Supreme Court decision on 
equality of educational opportunity. We have also reviewed the 
two ma jor federally-funded approaches to compensatory educa- 
tion: Project Head Start, and Project Follow Through. We have 
looked at these recommendatjons and projects in light of practi- 
cal school programs, such as the voluntary desegregation pro- 
gram in the Riverside Unified School District, and the Durkin 
precocious reader stud>\ with an eye to arri\'ing at conclusions 
which might guide the de\*elopment of instructional programs in 
public .schools. Recognizing the limitations in the data, we be- 
lieve the following conclusions are warranted. 

The preeminent conclusion which may be drawn from the 
Project Follow Through evaluation is that the ease of improving 
elementary education in the United States has been grossly un- 
derestimated by the United States Office of Education, the Con- 
gress, and most of the thirteen groups of curriculum developers 
and researchers responsible for the experimental programs tested 
in the project. Control group classrooms in the preponderance of 
comparisons scored higher on the dependent variables than did 
the experimental cla.ssrooms. They scored higher on basic skills 
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than nine of the thirteen model groups; they scored higher on 
Cognitive-Conceptual skills than nine of the thirteen model 
groups; they scored higher on affective outcomes than seven of 
the thirteen model groups. Regular classroom teachers teaching 
the regular curriculum of their school, without the benefit ol the 
additional millions of dollars of federal funds.outperformed the 
experimental group's in all outcome categories. 

;f'erhaps these findings support the belief that common 
school practice in the United States today represents the collec- 
tive vvi.sdom from more than one hundred years of pedagogical 
experience, and improvement of educational outcomes will de- 
pend more upon outcome from speeifie instruction than upon 
general approaches to general objectives. 

A second conclusion from this review is that if you want 
pupils to learn a particular skill or knowledge it is more efficient 
to teaeh it directly than to e.xpect it to transfer from other learn- 
ings As a group,'the Basic Skills Model programs (or Direct In- 
struction model) were superior to either of the other two groups 
on the basic skills measures. The program in this general model 
taught the specific skills required for word analysis and word rec- 
ognition and the pupils learned these skills better than they were 
learned bv pupils in the other model programs. 

Third, though the Affective/ Cognitive Model programs 
were directed more toward achieving affective outcomes, the Ba- 
sic Skills Model programs enjoyed their greatest superiority on 
these dependent variables. Only in this area were the Basic Skills 
approaches consistently superior to the control group ap- 
proaches as well as being superior to all but one of the other mod- 
el programs. It seems reasonable that improve! performance on 
affective outcome measures will be directly related to increased 
achievement in cognitive areas when programs focus upon the 
developmeiit of cognitive skills and knowledge. 

A substantial part of the life of the child is spent in .school. 
!n the school setting, parents pri/e academic achievement, teach- 
ers demand academic achievement, and pupils reali/e that some- 
how their status is dependent upon academic achievement. Basic 
skills instruction secures the greatest academic growth, and fos- 
ters high affective outcomes. 
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How to Recognize a Reading 
Comprehension Program: An Afterword 



John r. (iuihric 

i/nn'rnaiional Rcadifiy, Assin ianun 

Wc have enormous powers of recognition. For most people, it is 
nothing to rceogni/e a hospital, a farm, a game of chess, an oak 
tree, or a reading program. We can distinguish a hpspital from a 
bank, a farm from a hill, chess from football, a tree from a bush, 
and a reading program from a concert. Wc possess little 
awareness, and little need for awareness, about what we retain in 
our mental structures that allows us to make these perceptions 
rapidly, automatically, and thousands of times a day. 

' When it comes to reading programs, however, profession- 
als in the field need to be able to reliably go beyond the correct 
identification. A bird watcher may correctly identify a thrush, a 
robin, or a stilt without being able to name their unique features. 
Yet, the difference between the ornithologist and the amateur is 
that the ornithologist ha.s a store of knowledge that can be 
communicated to other people which will enable them to 
discriminate these species from each other and to describe the 
biological difference between birds and non-birds. 

The challenge for .students of reading is not to recognize a 
reading comprehension program when it is presented, nor to 
have the concept generally developed in the head, but, rather, to 
e.xternali/e the knowledge in a way that will allow another person 
to make the recognition and to distinguish a good example from 
a poor one. 
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Although the exercise of rceoi/!)ni|^^^) is ••^''^'^'^'ius n^^^ 
comnuinieatingabtuil readiim proufani.'^ .^Uk^^^ insi^'''*^^' I„..t"^^^ 
tulequate lor otiier purposes, por HU^ h)^^ p^^^ . 
programs tliat eurrentiv exist reui^'''^*\lM» 'i ^wic't-' '"^^ ,,n, 
inertia, l-.diication ol reading professioi' at^ ^^^i^.^s .,actice 
ancc of knowledge, a.s well a.s the 'ri;'^ . ' ^^,o^ ogram.s 
teacliuig. 1 he improv ement of rL-^i^^ii' M^.j^jtiri .^j^j, . 
demands criteria orcxcellcncc. rather ^i^^ ^^riii'^'^ ^^^^ 
lasliion. ,1 j- 

lo construct a system thiit ^ni^'' 
program. \vc may ask about thegcnerally^ ^vd '^^''^"^ on the 
reading program/* Typically. dcrini^'^^^|V I^^' cc'i^'^^^^jporied 
materials children arc expected to rc'^^^- ,|i q ^> Cli^" ^'ffer^ 
her ''critictd analysis of existing rc^^^'^^^^lie ,^^>^p,jfinlr ^part^cMi 

I reading ^( p. -^)' ;r"W^.^^J'd vst Tu 



approaches to beginning reading'^ (p. \fiii:^^k^>^c(i osi 



and Wilder (1964) that. -Readini^ i^ '''!^gradcd 



schot)ls starts IVom a similar basis: t^^^"^** k\ fr^^^ to 94 
series arc used by 9S percent of I'irst i^ra^. ot^'^^S^'^f-s jays of 
percent of second and third grade Xcii^^^^^'i in|sMj ^^fH^^^'*^^ .V^^. 
the year*; ( p. 188). Due to the domin^itin^t) cj^M^j^^i: ^'^'fthcm 
reading series. Chall went to some le^i^th i^^ ''^^pit)t* 
and gave the following definition of ^^^^ k^o^C*^^' 

■\ hiisal rciidinLt series. ..;uicnipis [O yiv-iv^"" ^Ob^^ ^^^^j P^^'^-j,^ (in ih^, 

I'jiichcrs* matuials). a collodion or'st^^ri^-^^i'I '^"^^^K^onk) 
rcaddhc rL^ulcrs).;indL\\crciscsror:id^l"»' ;vP''*^^^Vv^j^.^.(lh''*^^,^ onhrco 
riic scries iicncrally siaris wii h a prcri-*;» ' ,„fi^'; ^"^'^[yd TlK-n 
-rcacliiiy readiness** hooks lor kirult''''i^i*^ - ' "^<M Mr Jf'* rf^l ^^oks 
coMK- thc-*iiradcd- readers: ihrcc mo^ Jirs^ t^-J('''''^\ hOokuu^ 
(the prepriiiiers). whieh are followed tli''*-^^' 
primer), and then a firsi reader (i^e •-^/"iK* >.-pi''**'^*V- Jj'^^^^iigh 
(si)nienmes six) hooks are useil hy c*M»'il''V^^' '"^lu^^ p,' jciil basal 
some advance ihrouLth iheni lasiJr f'^^* ^'^ h^*-"' ^md 

reading scries has a hook lor caeh h:!^* ^^*' j'^^* ^K^^ .irf^''* ^ ^^^^^^ 
2-2 hoOlvS, Similar hreaks are Uiad^\'"'^''j|^'''.^V)^^j ihr^^^'^jL^h gHides^' 
tirades. .S^iie companies have readcr> i|i;ifi^''*\^\Viihc'aeh 
orelementaV^ school or ihe ninlh ^rud«^ ^''y^'*'' ^^h-'^'-**^^'' • ^""^^^^ 
• reader .uoes workhook a ••cori^LiiiVj.^'i'O^,^ ^^Xp'-'^''''*''^ ''''^^''^^'Ii^t's 
containinu a variety of exercises. i;^ v^i'o I'*'-\^*jers and 

tiiii(.Iehook (a manual on how [O ^'''\t^^^^\\^ ' i\)C ^is. lesis 
work hooks). .Schoiils can also purc*'^*'-'''' ' ^v^j^vif'^ 
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other aids will, each scr.cs. In aclclituu,. some publislKMs otT.r 
supplementary readers and suhjeet tnatter textbooks eoordmated with 
the hasal reading program (p. ISS). 

-I his emphasis on materials was not altered when Cliall 
interviewed the authors of major basal systems prominent at that 
time Most authors recommended that the program be 
supplemented bv related reading from the library, children s 
literature anthologies, and textbooks. Otherwise, elaborations 
on what a reading program might be were not given. 

.Another illustration of the prevailing orientation to 
materials is given by Bond and Dykstra (1967) in their 
description of the reading programs that formed the ba.sis ol the 
first grade reading studies. They noted that 

.\ lonrth treatment uroup was labeled Language H.Nperienee. ..\ basie 
dement ot this instrnetional method is that the ehdd s own wr,t,ng 
serves as a medium ol instruetion. I hc ehiUPs first stones are dietatcd to 
h eaeher who aets as the reeorder. As soon as he ,s able the pupd 
writes his own stories and shares them with the teacher, ^^-nng the 
„ulividL.al eonlerenees between pupil and teaeher. he ,s helped to 
reeouni/e the eommonality between the words he writes and speaks and 
he develops the skills ireeessary lor reading, I hts approach, thu . 
ordinarilv utilizes far fewer hi/i(>ly structured instructional matena s 
, n do most instructional programs, In.addit.on. voeabulury control s 
i wed as beinu in the language ttself and in the language backsround of 
each child. The pupil learns to read the words necessary lor htm to use n 
uritinu. One of the major instructional tasks in this tm-thod is to 
engender a stimulating language environment (p. 46). 

1 n this description, the content to be read by the child (the child's 
dictat.^d materials) is emphasi/.ed. The content is the dominant 
feature of this reading approach according to these authors. 

The astonishing fact about these two delimtions is that 
thev almost totally ignore the student ^vs a learner. It is as though 
•I readina program could somehoW e.xist independent of the 
children expected to learn from it. If teachers in an elementary 
school in the U nited States arc asked what reading program they 
use their replies are likely to be the names of such and such 
materials usuallv 'basal reading .series. It goes without saying 
that these materials are not simply warehoused or used as 
doorstops: thev are processed by children. Although children 
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lairn from the nuilLM ials, this learning is so inherent iluii it often 
goes without iiotiee or desei^inion. Yet a reading program is 
more than i\ set ol" materials or books or stories or exereises, A 
reading program is n eollection ol eontents to be read, studied, 
pondered, answered, and learned. 

More reeervtly.. the eoneept of a reading program has 
included statements of objeetives about what ehildren are 
expeeted to letirn. For extimple, a eontemporary reading 
program author states in the teaeher\s edition that seven skill 
aretis pernietite the progrtim. These inelude: 1) decoding, which 
''nuiy be viewed i\s the coriversion of written symbols into the 
sounds they represent'^ 2) comprehension, "the ability to grasp 
the meaning or metmings ofcomnuinication'^ }) language, which 
relers "to the child\s production of langutige and his her 
undcrstandingof l:jnguagc itseir; 4) research and study skills, 
whieii includes "finding and using various kinds of information": 
5) literature and the writer's crtift, which is "conce.ned with 
understanding different kinds of literature, distinguishing 
between ftict and fiction, tipprccititing tin authors style, etc.'': 6) 
attitude towtird self in residing, which is the way children react to 
reading on a personal basis; and 7) creativity, which is the 
cncourtigcmcnt of new. different, and unusual responses. 
.Although these objectives nuiy or may not be accomplished and 
the new materials mtiy or nniy not bcsuitable forthem, they stand 
as goals which tire rcltited to student Ictirning and comprise the 
intention of the program (Ruddell & Crews.^ 1978). 

A recent, extensi'-c tuKilysis of the Houghton MilTlinand 
(iinn 720 Series (Beck, McKeouen. McCaslin.*& Burkes, 1979), 
sttites that these series provide "tottil instructional systems," 
including the pedagogy lor teaching selections for ehildren to 
rctid tmd prticticc exercises. Beck, et ah claim the programs "exhibit 
similarly broad'.orientations to reading, which include decodinu, 
litertil and inlercniial comprehension, and enrichment skills" (p. 
5). They point out that most imits ( in other words, the nature of 
the program) are conditioned by the nature of the objectives for 
learning thai accompany the matcriafs and arc embodied in 
.them. 
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The recent addition of objectives to the concept of a 
reading program is still not sufficient to represent what people 
really \mean by a reading program. If a teacher stood in a 
classroom equipped with a basal rcadingserics, and there were no 
children in the classroom for one year, one would hardly say that 
a reading program had occurred. Likewise, if the children went to 
the classroom daily but never looked at the available basal 
materials nor any other reading materials, one would hardly say 
that a reading program had been experienced by the students. II 
the children looked at the materials but paid no attention in the 
extreme or found them to be written in a foreign language, one 
would not say the reading program had taken place. In the end, 
some learning mu.st occur for at least some children to allow the 
inference that a reading program is being conducted in a 
classroom. Otherwise, no . program exists. 

We have seen that processes of learning in children are 
intrinsic to the body of a reading program. These processes may 
include attention, visual and auditory perception, language 
comprehension, linguistic differentiation, understanding word 
meanings, and written communications. I'he point is that to see 
or describe or account for a reading program is to contend not 
only with the materials that are intended to produce learning but 
to capture the learning or information processing or reading 
comprehension that is intended to occur as a response to the 
materials provided. Without describing the kinds and amounts of 
comprehension that occur, we can t affirm that a program exists 
at all, much less whether it has been implemented as it was 
intended. To recognize a reading program, one must be able to 
recogni/e reading comprehension and its improvements in the 
learner. 

The present book contains several chapters of research on 
reading in the 1970s that are directed to describing the processes 
of reading comprehension. For example, Rumelhart illustrates 
that a schema is a representation of knowledge. In his network 
arc included units of information with relationships am.ongthefti 
in a certain pattern. From a schema, one constructs an 
interpretation of incoming information from written materials. 
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'riiese schemata, thcMi, tietermine wluit can be understood during 
the process of reading. In addition, schemata faeilitate the recall 
Of stories and text materials. If a person has a well-developed 
network of relationships in memory for a particular content area, 
then what one reads that is related to the schema will be clearly 
retrievable for relatively long periods of time. 

To recognize that reading comprehension is occurring in a 
classroom lor a child, we must know the type and rerincment of 
schemata possessed by the child, the type of text being read by the 
child, and the likely interaction between the two. If the child 
answers correctly a question over a paragraph, it is pos,sjblc that 
comprehension did not occur because the chile! already possessed 
the information in the form of background knowledge. It is 
po.ssible that comprehension did not occur because the child 
. answered the question "rotcly" from the text. Furthermore, if the 
child fails to answcrcorrectly, it is possible that it is not a pr-oblem 
of comprehension but rather one of insufficient background 
iiiformaiion, inappropriate vocabulary in the text, inadequate 
attention to the text, or unwillingness to provide an answer that is 
available in the child's response ,systcm. Therefore, to recognize 
comprehension we must perceive a correct answer to a question 
over material that was read by the child that could not have been 
ans\Vered from background knowledge or verbatim recall of the 
material in the text. To detect improvements in these abilities is to 
perceive correct answers to questions over text in which the text 
has increasingly complex language structures and the knowledge 
base of the child is larger in quantity and more complexly related 
to the text. 

In the present volume, Pearson and Camperell hayc 
illustrated that children possess not only schemata for content of 
written materials, but, also, schemata forstructure. For instance, 
a child's story is likely to have a set of syntactic relationships at 
the level of the story consistent with many other narratives, 
Pearson suggests that a schema for stories can be taught to 
elementary school children. If this is true, then one may recognize* 
the acquisition of com.prchension in a classroom if a child's 
schema for narrative is improving. A "better'' schema is one that 
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contains more adultlik^' constituents and their relatiopships. An 
increasingly complex scli^'nui allows children to understand and 
remember i"^'rcasingly ^^^^"^P'^^x narratives and to identify finer 
distinctions withj^^ thcni- with respect to the plot, its episodes, 
characters, reso^^j^^n o( conflict, and attitude of the writer. 

One proeoss necessary to comprehension is that of making 
inferences. As 1>abasso (in this book) has shown, inferencing 
allows the reader to connect the text in such a manner as to render 
a plausible intcrprctati<^" ^^^^ its meaning. Without a host of 
inferences, most text ii> ^I'^i^'iicnted to the point of incoherence. 
There are scvcr-ii typ^'-"^ inference causal, informational, 
temporai ^'^'aluativc, niotivational, and so on, Trabasso 
emphasizes that success in answering caasal inference questions 
is crucial to story eomp^'^'^^'"^'"". In his chapter there is evidence 
that children \vi\^^ .^^e better comprehenders are more likely to 
answer causal miestion-"^ correctly than arc children who are 
inferior in this process/f^'^ implies that one aspect of improving 
comprehension \^ tlK i^piovemcnt of inferences in terms of 
I rcquency, ^''^^^lpl^.xity^ ''^^^ appropriateness to the material lor 
which suitable schemata are available. 

Althougi^ word n^canings are'highly correlated to reading 
comprehension a substantial number of investigations have 
been made inih ^^ic relationship (over 3,000 since 1972 by one 
count), the reujjon for the association is not clear. As Anderson 
and Freebody suggest may be that these Variables arc 

correlated because word meanings are part of the task of reading 
comprehension. Anoth^^^ possibility is that vocabulary is a 
measure of aptitude f^^^ which comprehension is another 
indicator It is possible that vocabulary assesses the knowledge 
base that, is indispensable for comprehending a paragraph written 
on that topic. 

Equally tnvstcrious, according to Anderson, is just what it 
means to "kno^'d^^ nicaning of a word." To simply know its 
denotative meaning v^i" inadequate for many purposes, and 
yet it is not possible young children to know its subtle 
connotations that may ^^"^^ ^"^^t in a variety of contexts. The si/.e 
of a student's ora| vocabulary involves both depth and breadth of 
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wok! knowledge in a complex relationship. Despite the 
pliilosopliieal eoinplexily and empirical uncertainly in this tirea, 
it seems jiistilied lo recogni/e improvemenl in reading compre- 
hension when children's word meanings increase in mimherand 
complexity, and words thai previously rendered i\ sentence or 
phrase iiicomprchcnsible do not present the same obslacle at a 
later poinl in time. ■ 
There is one definilion of a reading program lhat focuses 
on the processes of comprehension and learning, StaulTer( 1969) 
states that 

I he ob}ccii \ c i>r i:roiip ( liircctcii rciuiiiiy leaching activity) iraininy is to 
dcvchif) .skill in critical reading. ID be a critical reader requires a 
coiiiniaiul or three iiUelleetually rcliiicd skills. I he first .skill is the art of 
inquiry, or of askii^g relevant nucstions. Allied with this is the ability to 
cDnjectnrc. estimate, hypothesi/e. A scholar doe.s not raise questions in 
an intellectual \acunni: rather by virtue of the knowledge and 
experience available to him at t lie time, he eonjceturcs about answers.' If 
he accept.s the ipiesiions that .someone else raises, it is even greater 
si^nilieaiKc th.at he .speculate about an.swers. I he second skills required 
IS that of proces.sing information, (he scholar does this in a 
[ircseribed. not random, order, as dictated by his reading ^loals. In many 
uays. the human mind acts like a computer if prnperlv programmed 
( inquiiA-reuulated. goal-set) data encountered arc proces.sed appropri- 
ately (as when eviilence i.s weighed selectively), i he third skill is lhat of 
valulating answers. This the scholar does either by testing his judgment 
again.st that of a group or b_v actually trying the answers to see if they are 
correct. I hus. in brief, the critical reader first feeds forward, or inquires, 
then he processes selectively, and finally he gets feedback by testing 
answers (p. 35). 

In this view, it is not the materials nor their objectives that 
are the defining quality but. rather, the processes performed by 
the learner. The teacher's responsibility is to provide written 
stimulation and .sufficient interaction with children to enable, 
these cognitive accomplishments to take place. 

A natural question at this point is, "What characteristics 
ol the reading materials will set into motion comprehension 
processes and generate the conditions for their improvement?'' 
An attempt to answer this was made by Jenkins and Pany(inthis 
volume). A survey of the experimental literature in instructional 
psychology revealed that the following characteri,stics have been 
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illiistrateci to improve comprehension lor written materials: 
appropriate headings placed in expository text, pictures that give 
supplemental information to adjunct niatcriak speed and fluency 
of word recognition, instruction in word meanings that are 
explicitly required by sentences, questjc^j/s in the text that are 
directly related to infornKition to be learned, and mental, 
elaborations on what is comprehended from reading. 

The strategy employed by .Jenkins and I*any in their 
review was to locate variables that have been found to correlate 
with comprehension tmd examine whether improving perfor- 
mance on these variables through intervention increases 
comprehension ability measured in another, appropriate form. 
Such synthesizing strategy is painstaking and requires sophistica- 
tion in experimental methodology. However, the findings arc 
well rooted in a conceptual base of instructional psychology and 
mav serve as benchmarks for promising educational materials. A 
reading program that contains them is far more likely to evoke 
constructive comprehension processes than one in which they are 
missing or sparsely scattered. 

In the reading program definitions given by ChalL Bond 
and Dykstra. Beck, and the authors of major published 
curriculum series, little is said about quantity of instruction. Yet, 
suppose a teacher had a desirable set of materials well suited to 
the children but only engaged the children in a lesson over 
reading for one day of the school year. In practical terms, the 
school that teaches math 179 days and reading for 1 day doesn't 
have a reading program. Yet, what jbout 2 days of reading 
instruction? Is that enough? How aboui 2 months? How many 
"hours'of reading activity is needed to conclude that a reading 
program has taken place within a school? This overwhelmingly 
obvious point has been neglected in the '50s/6()s, and '70s in the 
quest for an optimal method. What is being asked now is, "What 
is the optimal amount?" 

Berliner introduces the concept of academic engaged 
learning time to describe the quantity of instruction. It is based 
on the simple fact that the best predictor of reading achievement 
is the amount of time spent reading during the course of the 
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school year. This variable has been idcnliried by Harris and 
Serw'er(l%6) and highlighted by Rosenshine ( I97S), but it has been 
experimentally investigated by Berliner. He reports here that the 
range in amount of time spent engaged with materials oi' an 
appropriate level is broader than one might suppose; from 24 to 
5^) minutes a tlay, whieh is from 60 to I4S hours per year spent in 
reading instrueiion by ehildren. Berliner also illustrates that il' 
ehildren read niaterials which are appropriate in dilTieulty ioras 
little as 4 minutes a clay lor a period ofS weeks in thesehoolyear, 
their reading aehievement will not change, which means their 
percentile rank will decrease by 10. In his study, if children en- 
gaged in reading 23 minutes a day for 5 weeks, their percentile 
rank stayed the same. And if 52 minutes a day were spent in 
appropriate reading activities, an increase of 15 percentile points 
was observed in a 5-week period. What these dath point to is that 
to reeogni/e a reading program, one must measui - its quantity. 

Bu hner's analysis underscores the fact that the amount of 
a reading program is indexed by the amount of time children 
spend learning, rather than the amount of time allocated by the 
teacher or the amount of time the materials may be available in 
the library for use or the amount of time teachers may spend 
planning. It is noteworthy that for many reading educators. 
Sustained Silent Reading (ssu) has been hailed as a break- 
through. Across the country, teachers have taken time out of the 
reading program for children to read. When reading is regarded 
as an interruption, oi an innovation, it suggests that the program 
of educating children to read has been widely off the mark. 
.Although the materials and the processes they engage must be 
high caliber, the amount oi* engagement must be high for a 
reading program to be more than a name. 

Although we have developed considerable sophistication 
about comprehension processes and children's interaction with 
materials, we are not yet close to a concept of a reading program 
that is sufficient to recogni/c one. We have not ruled out the child 
in the haystack reading a book (perhaps with adjunct questions) ' 
or the princess in the courtyard being privately tutored. While 
these may be the occasion for the acquisition of reading, they are 



322 



Guthrie 



so unusual as to I'all outside the seope oT activities known as a 
reading program. Reading programs are found in seliools vvliieli 
are large, eomplex institutions with many sorts of indivitluais 
fulfilling a diversity of roles. It stands to reason that the 
eharaeteri.stics of the institution, the individuals who oeeupy it, 
and the relationships among groups within it should condition 
the reading program. 

Despite the unavoititible, social context in which learning 
to read occurs, the social structure in which reading programs are 
emhedtled has been excluded from educators' concepts of the 
programs. I 'or example, the Right to Read.systcm contracted the 
American Institute lor Research to identify exemplary reading 
programs. A typical example of the 12 that were found to meet 
preestablished criteria was a Title 1 reading center program in 
Broward County. Florida, described as follows: 

Stiiilcnts t ri)ni low income lamilios who urc lunclioniiig ;ippr().\iniatcly 
2 years below grade level in reading: in grades 1-6 eonie to tlK' main 
Keailing Center or one ol the. II sehool-hased centers Cor I hour ol" 
instriietion every other day (a total o\ 5 hours every 2 weeks). Hased on 
the results ol' stall-developed diagnostic tests, teachers at the centers 
prescribe an indiv idualized program of learning and rcinloreement 
activities lor each child. A child spends part ol each nour receiving 
direct instruction Irom the (cacher in skills development and the 
remainder ol the hour in rcinhuvcnicnt activities under the supervision 
ol an aide. .Stall ot the Research Department ol Hroward County 
Schools work in close cooperation with program managers and 
mstructors to determine the extent to which program objectives are 
realized, and to identify changes that should be made in tlie program 
(liowers. Campeau. Roberts. 1974. p. 62). 

The United States Office of Education has attempted to 
compile and disseminate "educational programs that work." The 
Far West Laboratory for Educational Research and Develop- 
ment issued in the fall of 1978, a publication containing 
descriptions of these programs. One program, entitled "Learning 
to Read by Reading." took place in California and was claimed to 
be "a unique method of teaching reading to sub-par achievers at 
upper elementary through junior college levels who have failed to 
progress through the use of conventional methods and materials" 
(pp. 7-42). A description is given, including the kinds of symbols 
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used, tlK- iLMigth of stories, the number of books read orally, and 
the number of plays enaeted during the eourse of the program. 
No statement was made about the social context, its structure, or 
the personal characteristics of the participants in the program. 

We can be grateful that sociologists and anthropologists, 
who arc not so immersed in reading programs as to make 
assumptions about their context, have reminded educators ol the 
dcuree to which classrooms and schools may be regarded as 
minisocieties. For example, in a study led by anthropologist Jules 
Henry, innereity schools were observed by a team of investigators 
(Ciouldncr. 1 978). They observed that 

Sclioolirii: was .in cxponcricclliai sccnia! lo rcwaiil liiosc LliiUlicii who 
lunsi icailiU vonturmcii \n the social onlor cMahlislial b\ llic scluu)! 
huivat.Liacs 1 he sclmnU wax rnoiv hcaiabk* places toi iliosc ciiiUircn 
who allowed ihcii rmiiMiluaiiiy arul aiilnru)iii\ > be shaped lo the 
corisira.iUs ot the classiooni and the rest nt ihe ir. liintioii. 

Some ot the chtivlrcn seenied to respmui \u. 'c willini-ly iliati 
oiluTs lo the ailinoinlioiis ptinted acrov. the lop o\ liie powers 
(kcoialiriii the tmrii wall t>t one liisi iirade classroom: **Sii Nice and 
Straii:hi'*"l)on*i 1 alk." **R:iise Vour Hand/* --non't C*hcw (;um.''''I)o 
Your Hesl Work." "Take Care of VourscHV* "lie Happy." "Snule. 
AUhonuh :hese sloiians were not necessarily followed all the lime by ihe 
liiiih achievers who\\ere the leacher's pels, they incorporaleil ijnitc well 
liow the teacher wished ihe students lo bcha\e most ot the tniic. I he 
siars panieipated m classroom activities. lolUnved directions well/and 
jia\e correct answers, and altht)ai!h ihcy were aiven much more I recjion? . 
io bL-ml the rnles than the rest ol the children, their reacher would ewrb 
them w hen ihev went loo Ian I he pets were adept at re.spondmii to iheir 
teacher's wavs ol" ct>ndiiclinii her elavs antl her need to keep order. On 
iheir own level. thcN p -essed an almosi oncannv iira.sp ot how the 
bureaucracy wiM ke 1 < p ■ ' ). 

Ca/den (this volume) suggests a number ol' social 
interaction variables that seem to be inHucntiaL For ^stance, the 
teacher's method of interacting is dilTcrcni with higher .achieving 
and lower achieving reading groups within the ela.ssrooni. \ he 
hiuhcr uroup receives more Irccdom. responsibility, and 
op^portunity lor scll-initiaied learning, fhe lower group is 
required to attend to questions and reading selections uniloi miy 
at the leachcfs ruquest. Children in the lower reading group arc 
interrupted, guided, and corrected more frcqucnlly than I hose of 
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the upper group (Allington, 1980). An analysis of these 
interruptions shows that they may inhibit, rather than enhance, 
learning. Doesn't this imply that because of the social structure 
created by the teacher in a classrcom, the children's opportunities 
for cognitive interaction with written materials are substantially 
altered? 

One dimension of the social structure is the group of 
children who are the learners. Their interactions with one 
another have been reported to be associated with learning by 
VValberg and his collaborators (this volume). They found that 
learning was positively associated with cohcsiveness (how well 
students know each other). But it was negatively associated with 
friction (whether students are responsible for petty quarrels). 
Other negatively associated variables relating to interactions 
include apathy (students don t care what the class does) and 
competitiveness. I'hey found parado.xically that difficulty of the 
work was positively associated with learning in high school but 
negatively associated with elementary school reading achieve- 
ment. The power of some of these variables is surprising. For 
instance, cohcsiveness and friction correlated to reading 
ai^hievement at the level of .80 and .90. It seems vitally important 
to incorporate social* goals into the reading program explicitly. 
To neglect them may be not only to subvert the academic 
intentions of the program, but also to condone a social system 
that produces dubious effects and consequences. 

Another girder of the social structure of a reading 
program is the administration. In the current volume, Samuels 
points out that elements that have been common to exemplary 
reading programs emerging from six nationwide surveys include: 
leadership, teacher training, specific reading objectives,' high 
intensity treatment, and efficient use of time. In addition, he 
noted that lack of leadership md inefficient use of time were 
common to unsuccessful efforts. These two variables seem to 
distinguish the remarkably effective programs from the appar- 
ently ineffective programs. The variable of leadership refers to 
enthusiasm, goal-setting, and organization of social and material 
relationships provided by a person in the reading program. It is 
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noteworthy that this social variable has been identified solely 
through studies that are extremely applied with the premium on 
locating and disseminating effective systems. Bodies of research 
that are theoretically based, such as the one summarized by 
Jenkins and F^any, and are related to instruction have not 
accented this feature of the reading program. 

The school is a unit in the social system of a community. 
Vet, the relationNhips between the school and the community 
have not been fu'lv examined by reading education researchers, 
save for the recognition that "socioeconomic statu.s'' of pupils is 
an overpowering determinant of school achievement. In the 
present volume, two authors point to a linkage between the child 
in the reading program and the community, which is through the 
oral language. Ca/den reports that in a highly effective Hawaiian 
reading program the cultural congruence of context for reading is 
high. 1 he reason for this is that the classroom 'Messon-talk'' is 
similar to the collaborative narrative of personal experience in 
Hawaii. In other words, ibrms of spoken interaction, which 
entail social systems that occur in the community, also prevail in 
ihe cH:*ssroom. As another insu^ncc, Elley illustrated thai a highly 
effective reading and language program in bilingual communities 
in the South Pacific was developed by reading stories to children 
in the second language that were highly familiar to them in their 
i'irst language. Although relationships between the school and its 
surrounding culture have not been articulated, they are implicit 
in our conceptii^n of a reading program. Tending and raising 
sheep is not part of the curriculum in most innercity American 
schools, although it is universal in rural schools in New Zealand 
which is a sheep raising country. Beyond the obvious, however, 
we have not expressed the values, knowledge, activities, and 
competencies important for the community that should be 
learned in schools fn)r examined the congruence between our 
aspirations and o\- r practices in this regard. 

A model a reading program that may be sul'i'icicnt 
might be dcscrTbcd in terms of several categories of variables: 
student learning and intcractiot]; reading materials: and a social 



326 



Guthrie 



system of teaching, administering, and community living. This 
book lias ioeused primarily on the processes of learning and 
comprehension that must occur to say that a reading program 
exists. Cognitive interactions between the learner and the 
materials have been described with an attempt to identify 
variables in the materials that facilitate comprehension. Interac- 
tions between students who exist within the circle of learning 
seem to be important for the cognitive accomplishments that are 
the most widely shared goals of the reading instructional system. 
The social context for learning- which consists of school and 
community personnel, their behaviors, and their relationships to 
the learner is crucial. Most teachers can recogni/e a program 
with these factors. Although researchers have succeeded in 
externalizing some important proees.ses nf comprehending, the 
available analysis of content is superficial, and our knowledge ol 
the social contcxi is primitive. 
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